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SYN THESIS OF GLYCOPROTEINS US ING BACTERIAL 
GLYCOSYLTRANSFERASES 



CROSS-REFERENCES TO RELATED APPLICATIONS 
[0001] This application claims the benefit of U.S. Provisional Application No, 60/398,156, 
filed July 23, 2002, and U.S. Provisional Application 60/424,894, filed November 8, 2002; 
both of which are herein incorporated by reference for all purposes. 



[0002] This invention provides nucleic acid and amino acid sequences of 
fucosyltransferases fi*om Helicobactor pylori. The invention also provides methods to use 
the fucosyltransferases to synthesize oligosaccharides, glycoproteins, and glycolipids. 



[0003] Although in recent years significant advances have been made in carbohydrate 
chemistry, there are still substmitial difficulties associated with the chemical synthesis of 



glycoconjugates, particularly with the formation of the ubiquitous p-l,2-cis-maTmoside 
linkage found in manunahan oligosaccharides. Moreover, regio- and stereo-chemical 
obstacles must be resolved at each step of the de novo synthesis of a carbohydrate. 

[0004] In view of the difficulties associated with the chemical synthesis of 
glycoconjugates, the use of glycosyltransferases to enzymatically synthesize glycoproteins 
and glycolipids, having desired oligosaccharide moieties, is a promising approach to 
preparing such glycoconjugates. Enzyme-based syntheses have the advantages of 
regioselectivity and stereoselectivity, and can be performed using unprotected substrates. 
Moreover, glycosyltransferases have been used to enzymatically modify oligosaccharide 
moieties and have been shown to be very effective for producing specific products with good 
stereochemical and regiocheraical control. The glycosyltransferases of interest include 
fiicosyltransferases, sialyltransferases, galactosyltransferases, and N- 
acetylglucosaminyltransferases. For a general review, see, Crout et al, Ciin\ Opiii, Chenu 
Biol 2: 98-1 1 1 (1998) and Arsequell, et al, Tetrahedon: Assynnetry 10: 2839 (1997). 

[0005] Many glycoproteins and glycolipids require the presence of a particular glycofomi, 
or the absence of a particular glycoform, in order to exhibit a particular biological activity. 
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For example, many glycoprotein and glycolipids require the presence of particular 
fucosylated structures in order to exhibit biological activity. Intercellular recognition 
mechanisms often require a fucosylated oligosaccharide. For example, a number of 
glycoproteins that function as cell adhesion molecules, including P-selectin, L-selectin, and 
5 E-selectin, bind specific cell surface fucosylated carbohydrate structures such as the sialyl 
Lewis-x and the sialyl Lewis-a structures. In addition, the specific carbohydrate structures 
that form the ABO blood group system are fucosylated. The carbohydrate stmctures in each 
of the three groups share a Fucal,2Galpl-disaccharide unit. In blood group O structures, this 
disaccharide is the terminal structure; whereas the blood group A structure is formed by an 
10 a 1,3 GalNAc transferase that adds a terminal GalNAc residue to the disaccharide; and the 
blood group B structure is formed by an al,3 galactosyltransferase that adds a terminal 
galactose residue. 

[0006] The Lewis blood group structures are also fucosylated. For example the Lewis-x 
and Lewis-a structures are Gaipi,4(Fucal,3)GlcNac and Gaipi,3(Fucocl,4)GlcNac, 

15 respectively. Both these stmctures can be further sialylated (NeuAca2,3-) to form the 

corresponding sialylated stmctures. Other Lewis blood group stmctures of interest are the 
Lewis-y and Lewis-b structures which are Fucal,2Gaipi,4(Fucal,3)GlcNAcP-OR and 
Fucal,2Galpl,3(Fucal,4)GlcNAc-OR, respectively. For a description of the stmctures of 
the ABO and Lewis blood group structures and the enzymes involved in their synthesis see^ 

20 Essentials of Glycobiology, Varld et al. eds,. Chapter 16 (Cold Spring Harbor Press, Cold 
Spring Harbor, NY, 1999). 

[00071 Specifically, fiicosyltransferases have been used in synthetic pathways to transfer a 
fiicose residue from guanosine-5'-diphosphofucose to a specific hydroxyl of a saccharide 
acceptor. A variety of donor substrates and acceptor substrates are known (see Guo et aL, 
25 Applied Biochem. and Biotech. 68: 1-20 (1997)). For example, Ichikawa prepared sialyl 
Lewis-x by a method that involves the fiicosylation of sialylated lactosamine with a cloned 
fucosyltransferase (Ichikawa et aL, J. Am, Chem. Soc. 114: 9283-9298 (1992)). Lowe has 
described a method for expressing non-native fiicosylation activity in cells, thereby producing 
fucosylated glycoproteins on cell surfaces, etc. (U.S. Patent No. 5,955,347). 

30 [0008] Thus, since the biological activity of many commercially important recombinautly 
and transgenically produced glycoproteins and glycolipids depends upon the presence of a 
particular glycoform, or the absence of a particular glycofonn, a need exists for an efficient 
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method for enzymatically synthesizing glycoconjugates having the desired fiicoylated 
oligosaccharide moieties. In additoin, there is a need for the efficient production of 
focosylated oligosaccharides. The present invention fulfills these and other needs. 

5 BRIEF SUMMARY OF THE INVENTION 

[0009] The present invention provides a-1 ,3/4-fLicosyltranferase proteins and nucleic aicds 
from H. pylorL The a-l,3/4-fucosyltranferase proteins catalyzes the transfer of a fiicose 
residue from a donor substrate to an acceptor substrate. In one embodiment, the invention 
provides or- 1,3/4-fucosyltranf erase nucleic acids with greater than 90% identity to a 
10 nucleotide sequence selected from SEQ ID NO:l, 3, or 7 and that encode a-1,3/4- 

fticosyltranferase proteins that transfer fucose to GlcNAc residues. In another embodunent, 
the invention provides a-l,3/4-fiicosyltranferase nucleic acids with greater than 90% identity 
to SEQ ID NO:5 and that encode Q:-l,3/4-fucosyltranferase proteins that transfer fucose to 
Glucose residues. 

1 5 (001 0] In another embodiment the a-1 ,3/4-fiicosyltranferase nucleic acid is selected from 
SEQ ID NO: 1 , 3, 5 or 7. The invention also provides nucleic acid sequences that encode a- 
1,3/4-fiicosyltranferase proteins, including SEQ ID NO:2, 4, 6, or 8 and that catalyze the 
transfer of fiicose to an N-acetylglucosamine residue or to a glucose residue. In one aspect 
the encoded a-l,3/4-fiicosyltransferase also includes an amino acid tag. 

20 [001 1] In a further aspect the invention provides an isolated nucleic acid that includes SEQ 
ID NO: 11, and that encodes an a-l,3/4-fiicosyltransferase protein that catalyzes the transfer 
of a fiicose residue from a donor substrate to a glucose residue. In another aspect the 
invention provides a nucleic acid that encodes SEQ ID NO: 12. 

[0012] In another embodiment the invention provides expression vectors that include the 
25 above described or- 1 ,3/4-fiicosyItranferase nucleic acids, host cells that include the expression 
vectors, and methods to produce the a-l,3/4-fiicosyltranferase proteins using the host cells 
cultured under conditions suitable for expression of the a-l,3/4-fucosyltransferase protein. 

[0013] In another embodiment the invention provides recombinant fucosyltransferase 
proteins that include amino acid sequence having greater than 90% identity to SEQ ID NO:2, 
30 4, or 8, wherein the fucosyltransferase catalyzes the transfer of a fiicose residue from a donor 
substrate to N-acetylglucosamine. In another embodiment the invention provides 
recombinant fiicosyltransferase proteins that include amino acid sequence having greater than 
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90% identity to SEQ ID NO:6, wherein the fiicosyltransferase catalyzes the transfer of a 
fiicose residue from a donor substrate to glucose. In one aspect, the fiicosyltransferase 
proteins comprise SEQ ID NO:2, 4, 6, or 8. In another aspect the iucosyltranferase proteins 
also include an amino acid tag. 

5 I0014J In another embodiment the invention provides recombinant fiicosyltransferase 

proteins that include SEQ ID NO: 12, and that catalyzes the transfer of a fucose residue from 
a donor substrate to glucose. In another aspect the fiicosyltranferase proteins also include an 
amino acid tag. 

[0015] The present invention also provides methods to use the above a-1,3/4- 
10 fiicosyltransferase protein to produce fiicosylated oligosaccharides. The fiicosylated 
oligosaccharides can be further purified. The acceptor substrate can be either N- 
acetylglucosamine or glucose depending on the needs of the user. In one embodiment the 
acceptor substrate is Lacto-N-neo-Tetraose (LNriT) and the fucosyltated product is Lacto-N- 
Fucopentaose III (LNFP DI). The Qf-l,3/4-fucosyltransferase can be used in combination 
15 with other glycosyltransferases to produce a fiicosylated oligosaccharide. For example, using 
lactose as a starting material, LNFP can by produced through the action of an a-1,3/4- 
fucosyltransferase that transfers fiicose to N-acetylglucosamine, a i8-l,3-N- 
acetylglucosaminyltransferase, and a /8-1,4-galactosyltransferase. The /S-l,3-N- 
acetylglucosaminyltransferase and the )8-l,4-galactosyltransferase can be bacterial enzymes 
20 and in a preferred embodiment are from Neisseria gonococcus. 

[001 61 In another embodiment, the oc- 1 ,3/4-fiicosyltransferase protein of the present 
invention are used to produce fiicosylated glycolipids. The acceptor substrate can be either 
N-acetylglucosamine or glucose depending on the needs of the user. 

[001 7J In another embodiment, the present invention provides a method for producing a 
25 fiicosylated glycoprotein, by combining an a-l,3/4-fiicosyltransferase described herein with a 
glycoprotein that includes an appropriate acceptor substrate under conditions suitable to 
produce a fiicosylated glycoprotein. The acceptor substrate can be selected from Gaipi-OR, 
Gaip,3/4GlcNAc-OR, NeuAca2,3Gaipi,3/4GlcNAc-Or, wherein R is an amino acid, a 
saccharide, an oligosaccharide, or an aglycon group having at least one carbon atom. The 
30 accepter substrate can be an N-acetylgJucosamine residue or a glucose residue. The a- 1,3/4- 
fiicosyltransferase can also include an amino acid tag. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[0018] Figure 1 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
from H. pylori strain 1 182B. 

5 [0019] Figure 2 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
from H. pylori strain 111 1 A. 

[0020] Figure 3 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
from H, pylori strain 1218B. 

[0021] Figure 4 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
10 from K pylori stram 19C2B. 

[0022] Figure 5 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
from H. pylori strain 91 5A. 

[0023] Figure 6 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
from H. pylori strain 26695 A. 

15 [0024] Figure 7 provides the nucleic acid and amino acid sequences of fiicosyltranferase 
from H. pylori strain 19C2A. 

[0025] Figure 8 provides an alignment between 1 182 fiitB amino acid sequence and a 
consensus sequence from the glycosyltransferase family 10, /.e., the fiicosyltransferase 
family. Amino acids 23 through 305 of 1 182 fiitB are shown in the top Une and represent the 
20 most conserved region of the protein, i,e. the fiicosyltransferase catalytic domain. 

[0026] Figure 9 provides an alignment between 1111 fiitA amino acid sequence and a 
consensus sequence from the glycosyltransferase family 10, /.e, the fiicosyltransferase 
family. Amino acids 27 through 417 of 1 182 fiitB are shown in the top line and represent the 
most conserved region of the protein, i,e. the fiicosyltransferase catalytic domain. 

25 [0027] Figure 1 0 provides an alignment between 1218 fiitB amino acid sequence and a 
consensus sequence from the glycosyltransferase family 10, Le,^ the fiicosyltransferase 
family. Amino acids 23 through 399 of 1 182 fiitB are shown in the top line and represent the 
most conserved region of the protein, Le. the fiicosyltransferase catal3^c domain. 

. [0028] Figure 1 1 provides an alignment between 19C2 fiitB amino acid sequence and a 
'30 consensus sequence from the glycosyltransferase family 10, z.e., the fiicosyltransferase 

5 
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family. Amino acids 23 through 377 of 1 182 fiitB are shown in the top line and represent the 
most conserved region of the disclosed protein, te. the fucosyltransferase catalytic domain. 

[00291 Figure 12 provides an alignment between amino acid sequence of//, pylori strains 
1182 FutB, nil FutA, 1218FutB, 19C2 FutB, 915FutA, 19C2 FutA, and 26695 FutA. The 
5 bottom sequence is a consensus sequence. 

[0030] Figure 13 provides an alignment between nucleic acid sequence of 77. pylori strains 
1182 FutB, 1111 FutA, 1218FutB, 19C2 FutB, 915FutA, 19C2 FutA, and 26695 FutA. The 
bottom sequence is a consensus sequence. 

[0031] Figure 14 provides oligosaccharide structures of Lacto-N-neo-Tetraose (LNnT), a 
10 substrate of the H. pylori fiicosyltransferases and Lacto-N-Fucopentaose in (LNFPDI or 
LNFUT), a product of the H, pylori fiicosyltransferases. 

[0032] Figure 15 provides the results of analysis of acceptor specificity for the H, pylori 
fiicosyltransferases. 

[0033] Figure 1 6 provides the yield of LNFIII synthesis using the H, pylori 
15 fiicosyltransferases. Two ion exchange resins were tested: MR3 NH4HCO3 and 
Dowexl/Dowex50 resin. 

[0034] Figure 17 demonstrates the use of FutB a-l,3/4-fiicosyltranferase from i/. pylori 
strain 1 182 to transfer fiicose to the glycoprotein asialyltranferrin. The upper panel shows 
GC/MS analysis of sialylated transferrin. The lower panel shows GC/MS analysis of 
20 sialylated transferrin that has been enzymatically asialylated and then fiicosylated using K 
pylori strain 1182 FutB a-l,3/4-fiicosyltranferase. Key to sugar structures: filled squares- 
GlcNAc; open circles-mannose; filled diamonds-galactose; triangles-fiicose; stars-sialic acid. 

DEFINITIONS 

25 [0035] Unless defined otherwise, all technical and scientific terms used herein generally 
have the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Generally, the nomenclature used herein and the laboratory 
procedures in cell culture, molecular genetics, organic chemistry and nucleic acid chemistry 
and hybridization described below are those well known and commonly employed in the art, 

30 Standard techniques are used for nucleic acid and pqjtide synthesis. Generally, enzymatic 
reactions and purification steps are performed according to the manufacturer's specifications. 
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The techniques and procedures are generally performed according to conventional methods in 
the art and various general references {see generally, Sambrook et al Molecular CLONING: 
A Laboratory Manual. 2d ed. (1989) Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N. Y., which is incorporated herein by reference), which are provided throughout this 
5 documcMt. The nomenclature used herein and the laboratory procedures in analytical 
chemistry, and organic synthetic described below are those weU known and commonly 
employed in the art. Standard techniques, or modifications thereof, are used for chemical 
syntheses and chemical analyses. 

[0036] The terms "of- 1 ,3/4-fucosyltranferase or fucosyltransferase" or a nucleic acid 
0 encoding an "Q^l,3/4-fucosyltranferase or fucosyltransferase" refer to nucleic acid and 

polypeptide polymorphic variants, alleles, mutants, and interspecies homologs that: (1) have 
an amino acid sequence that has greater than about 60% amino acid sequence identity, 65%, 
70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
or greater amino acid sequence identity, preferably over a region of at least about 25, 50, 100, 
200, 500, 1000, or more amino acids, to a polypeptide encoded by a nucleic acid selected 
from SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, or SEQ ID 
NO:13; or an amino acid sequence of SEQ ID NO:2; , SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:10, or SEQ ID NO:14; (2) specifically bind to antibodies, e.g., polyclonal 
antibodies, raised against an immuuogen comprising an amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, or SEQ ID NO:14; 
immunogenic fragments thereof and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid encoding 
SEQ ID NO:2; , SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, or SEQ ID 
N0:14; e.g., a nucleic acid sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7, SEQ ID NO:9, or SEQ ID NO: 13; or its complement, and conservatively modified 
variants thereof; (4) have a nucleic acid sequence that has greater than about 90%, preferably 
greater than about 91%, 92%, 93%, 94%. 95%, 96%, 97%, 98%, 99%, or higher nucleotide 
sequence identity, preferably over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ED NO:7, SEQ ID 
NO:9, or SEQ ID NO; 13; or its complement. The nucleic acids and proteins of the invention 
include both naturally occurring or recombinant molecules. 

[0037J The a-1 ,3/4-fiicosyItranferase enzymes of the invention can also be recognized by 
the presence of highly conserved catalytic domains that are found in a family of 
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fucosyltransferase proteins, glycosyltransferase family 10, see e.g., gnl|CDD| 16836 
pfam00852, Glyco_transf_10. Alignments between conserved catalytic domains of 1 182 
fiitB, 1111 fiitA, 1218 fiitB, and 19C2 fiitB and a consensus sequence from the catalytic 
domain of glycosyltransferase family 10 members are shown in figures 8-11. 

5 [0038] A biologically active fucosyltransferase as described herein is a fucosyltransferase 
that catalyzes the transfer of fiicose from a donor substrate, for example, GDP-fucose, to an 
acceptor molecule in an ce-l,3/4-linkage. The acceptor molecule can be either N- 
acetylglucosylamine (GlucNAc) or glucose. For example, Fucosyltransferases from the 
following K pylori strains transfer fiicose to Glc-NAc: Strain 915 FutA, Strain 1111 FutA, 

10 Strain 19C2 FutB, and Strain 1 182 FutB. The FutA gene product from K pylori Strain 19C2 
FutA transfers fucose to the reducing glucose of the LNnT acceptor, as did the FutB gene 
product from H. pylori strain 1218, and a novel 26695 FutA protein. In preferred 
embodiments, the fiicosyltransferase transfers fuscose exclusively to GIcNAc or exclusively 
. to glucose. The acceptor molecule can be a carbohydrate, an oligosaccharide, a glycolipid, or 

15 a glycoprotein. 

[0039] The H. pylori fiicosyltransferase proteins of the invention are useful for transferring 
a saccharide from a donor substrate to an acceptor substrate. The addition generally takes 
place at the non-reducing end of an oligosaccharide or carbohydrate moiety on a biomolecule. 
However, in some embodiments the fucose residue is added to a reducing glucose residue. 
20 Biomolecules as defined here include but are not limited to biologically significant molecules 
such as carbohydrates, oligosaccchrides, proteins {e.g.^ glycoproteins), and lipids (e.g., 
glycolipids, phospholipids, sphingolipids and gangliosides). 

[0040] The following abbreviations are used herein: 



Ara 



arabinosyl; 



25 



Fra 



fiuctosyl; 
fiicosyl; 



Fuc 



30 



Gal = galactosyl; 

GalNAc = N-acetylgalactosylamino; 
Glc = glucosyl; 

GlcNAc = N-acetylglucosylamino; 

Man = mannosyl; and 

NeuAc = sialyl (N-acetylneuraminyl) 
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FT or Fut = fiicosyltransferase* 
ST = sialyltransferase* 
GalT = galactosyltransferase* 

[0041] Oligosaccharides are considered to have a reducing end and a non-reducing end, 
whether or not the saccharide at the reducing end is in fact a reducing sugar. In accordance 
with accepted nomenclature, oMgosaccharides are depicted herein with the non-reducing end 
on the left and the reducing end on the right. 

10042] All oligosaccharides described herein are described with the name or abbreviation 
for the non-reducing saccharide {e.g.. Gal), followed by the configuration of the glycosidic 
bond (a or p), the ring bond, the ring position of the reducing saccharide involved in the 
bond, and then the name or abbreviation of the reducing saccharide (e.g., GlcNAc). The 
linkage between two sugars may be expressed, for example, as 2,3, 2^3, or (2,3). Each 
saccharide is a pyranose or furanose. 

10043 J The term "sialic acid'' refers to any member of a family of nine-carbon carboxylated 
sugars. The most common member of the siahc acid family is N-acetyl-neuraminic acid (2- 
keto-5-acetamido-3,5-dideoxy-D-glycero-D-galactononulopyranos- 1 -onic acid (often 
abbreviated as NeuSAc, NeuAc, or NANA). A second member of the family is N-glycolyl- 
neuraminic acid (Neu5Gc or NeuGc), in which the N-acetyl group of NeuAc is hydroxylated. 
A third sialic acid family member is 2-keto-3-deoxy-nonulosonic acid (KDN) (Nadano et al 
(1986) J. Biol Chem. 261: 11550-11557; Kanamori etal, J. Biol. Chenu 265: 21811-21819 
(1990)). Also included are 9-substituted siahc acids such as a 9-0-Ci-C6 acyl-Neu5Ac Uke 
9-0-lactyl-Neu5Ac or 9-0-acetyl-Neu5Ac, 9-deoxy.9-fluoro-Neu5Ac and 9-azido-9-deoxy- 
Neu5Ac. For review of the sialic acid family, see, e.g., Varki, Glycobiology 2: 25-40 (1992); 
Sialic Acids: Chemistry, Metabolism and Function, R. Schauer, Ed. (Springer- Verlag, New 
York (1992)). The synthesis and use of sialic acid compounds in a sialylation procedure is 
disclosed in international application WO 92/16640, published October 1, 1992. 

[0044] An ''acceptor substrate" for a glycosyltransferase is an oligosaccharide moiety that 
can act as an acceptor for a particular glycosyltransferase. When the acceptor substrate is 
contacted with the corresponding glycosyltransferase and sugar donor substrate, and other 
necessary reaction mixture components, and the reaction mixture is incubated for a sufficient 
period of time, the glycosyltransferase transfers sugar residues from the sugar donor substrate 
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to the acceptor substrate. The acceptor substrate will often vary for different types of a 
particular glycosyltransferase. 

[0045] An "acceptor substrate" for an K pylori fucosyltransferase is an oligosaccharide 
moiety that can act as an acceptor for a the H. pylori fucosyltransferase. When the acceptor 
substrate is contacted with the K pylori fucosyltransferase and sugar donor substrate (e,g,, 
GDP-fiicose), and other necessary reaction mixture components, and the reaction mixture is 
incubated for a sufficient period of time, the K pylori fucosyltransferase transfers fiicose 
residues from the GDP-fucose to the acceptor substrate. The acceptor substrate will often 
vary for different types of a particular fucosyltransferases. For example, the acceptor 
substrate for a mammaUan galactoside 2-L-fucosyltransferase (al,2-fiicosyltransferase) will 
include a Gaipi,4-GlcNAc-R at a non-reducing terminus of an oligosaccharide; this 
fucosyltransferase attaches a fucose residue to the Gal via an a 1,2 linkage. Terminal 
Gaipi,4-GlcNAc-R and Galpl,3-GlcNAc-R and sialylated analogs thereof are acceptor 
substrates for a 1,3 and al,4-fucosyltransferases, respectively. These enzymes, however, 
attach the fucose residue to the GlcNAc residue of the acceptor substrate. Accordingly, the 
term "accq)tor substrate" is taken in context with the particular glycosyltransferase of interest 
for a particular application. The K pylori fucosyltransferase described herein will transfer 
fucose to sialylated or unsialylated acceptor substrates. Some H. pylori fucosyltransferase 
described herein will transfer fucose to glucose residues. 

[0046] A "donor substrate" for glycosyltransferases is an activated nucleotide sugar. Such 
activated sugars generally consist of uridine, guanosine, and cytidine monophosphate 
derivatives of the sugars (UMP, GMP and CMP, respectively) or diphosphate derivatives of 
the sugars (UDP, GDP and CDP, respectively) in which the nucleoside monophosphate or 
diphosphate serves as a leaving group. For example, a donor substrate for fucosyltransferases 
is GDP-fucose. Donor substrates for sialyltransferases, for example, are activated sugar 
nucleotides comprising the desired sialic acid. For instance, in the case of NeuAc, the 
activated sugar is CMP-NeuAc. 

[00471 A "substantially uniform glycoform" or a "substantially uniform glycosylation 
pattern," when referring to a glycoprotein species, refers to the percentage of acceptor 
substrates that are glycosylated by the glycosyltransferase of interest {e.g,, 
fucosyltransferase). For example, in the case of the al,3 or al,4 fucosyltransferase noted 
above, a substantially uniform fucosylation pattern exists if substantially all (as defined 
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below) of the Galpl,4"GlcNAc-R and sialylated or unsialylated analogues thereof are 
fiicosylated in a composition comprising the glycoprotein of interest. It will be understood 
by one of skill in the art, that the starting material may contain glycosylated acceptor 
substrates (e.g., fiicosylated Gaipi,4-GlcNAc-R substrates). Thus, the calculated amount of 
glycosylation will include acceptor substrates that are glycosylated by the methods of the 
invention, as well as those acceptor substrates already glycosylated in the starting material. 

10048] The temi "substantially" in the above definitions of "substantially uniform" 
generally means at least about 60%, at least about 70%, at least about 80%, or more 
preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% 
of the acceptor substrates for a particular glycosyltransferase are glycosylated (e.g., 
fiicosylated Galpl,4-GlcNAc-R substrates). 

[00491 The term "substantially identical fiicosylation pattern/' refers to a glycosylation 
pattern of a glycoprotein produced by a method of the invention which is at least about 80%, 
more preferably at least about 90%, even more preferably at least about 91%, 92%, 93%, 
94%, or 95% and still more preferably at least about 96%, 97%, 98% or 99%identical to the 
fiicosylation of a known glycoprotein. "Known fiicosylation pattern," refers to a fiicosylation 
pattem of a known glycoprotein firom any source having any known level of fiicosylation. 

[00501 The term "amino acid" refers to naturally occurring and synthetic amino acids, as 
well as amino acid analogs and amino acid mimetics that fimction in a maimer similar to the 
naturally occurring amino acids. Naturally occurring amino acids are those encoded by the 
genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups (e.g, norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical 
compounds that have a structure that is different from the general chemical stmcture of an 
amino acid, but that fimctions in a manner similar to a naturally occurring amino acid. 

[0051 J *Trotein", "polypeptide", or "peptide" refer to a polymer in which the monomers are 
amino acids and are joined together through amide bonds, alternatively referred to as a 
polypeptide. When the amino acids are a-amino acids, either the L-optical isomer or the D- 



11 



wo 2004/009793 



PCT/US2003/023155 



optical isomer can be used. Additionally, unnatural amino acids, for example, p-alanine, 
phenylglycine and homoarginine are also included. Amino acids that are not gene-encoded 
may also be used in the present invention. Furthermore, amino acids that have been modified 
to include reactive groups may also be used in the invention. All of the amino acids used in 
5 the present invention may be either the D - or L -isomer. The L -isomers are generally 

preferred. In addition, other peptidomimetics are also useful in the present invention. For a 
general review, see, Spatola, A. F., in Chemistry and Biochemistry of Amino Acids, 
Peptides and Proteins, B. Weinstein, eds.. Marcel Dekker, New York, p. 267 (1983). 

[0052] The term "recombinant" when used with reference to a cell indicates that the cell 
10 replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 

heterologous nucleic acid. Recombinant cells can contain genes that are not found within the 
native (non-recombinaut) form of the cell. Recombinant cells can also contain genes found 
in the native form of the cell wherein the genes are modified and re-introduced into the cell 
by artificial means. The term also encompasses cells that contain a nucleic acid endogenous 
15 to the cell that has been modified without removing the nucleic acid firom the cell; such 

modifications include those obtained by gene replacement, site-specific mutation, and related 
techniques. A "recombinant protein'' is one which has been produced by a recombinant cell. 

[0053] A "fusion protein*' refers to an if. pylori fiicosyltransferase protein comprising 
amino acid sequences that are in addition to, in place of, less than, and/or different from the 
20 amino acid sequences encoding the original or native fiill-length protein or subsequences 
thereof. 

[0054] Components of fiision proteins include "accessory enzymes" and/or "purification or 
amino acid tags." An "accessory enzyme" as referred to herein, is an enzyme that is involved 
in catalyzing a reaction that, for example, forms a substrate for a fiicosyltransferase. An 

25 accessory enzyme can, for example, catalyze the formation of a nucleotide sugar that is used 
as a donor moiety by a fiicosyltransferase, e,g,, GDP-fiicose. An accessory enzyme can also 
be one that is used in the generation of a nucleotide triphosphate required for formation of a 
nucleotide sugar, or in the generation of the sugar which is incorporated into the nucleotide 
sugar, e.g., fiicose. The recombinant fiision protein of the invention can be constructed and 

30 expressed as a fiision protein with a molecular "purification tag" at one end, which facilitates 
purification of the proteia Such tags can also be used for immobilization of a protein of 
interest during the glycosylation reaction. Suitable tags include "epitope tags," which are a 
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protein sequence that is specifically recognized by an antibody. Epitope tags are generaUy 
incorporated into fusion proteins to enable the use of areadily available antibody to 
unambiguously detect or isolate the fiision protein. A 'TLAG tag" is a commonly used 
epitope tag, specificaUy recognized by a monoclonal anti-FLAG antibody, consisting of the 
sequence AspTyrLysAspAspAsp AspLys or a substantially identical variant thereof. Other 
suitable tags are kpown to those of skill in the art, and include, for example, an affinity tag 
such as a hexahistidine peptide, which will bind to metal ions such as nickel or cobalt ions. 
Purification tags also include maltose binding domains and starch binding domains. 
Purification of maltose binding domain proteins is know to those of skill in the art. Starch 
binding domains are described in WO 99/15636, herein incorporated by reference. Affinity 
purification of a fiision protein comprising a starch binding domain using a betacylodextrin 
(BCD)-derivatized resin is described in USSN 60/468,374, filed May 5, 2003, herein 
incorporated by reference in its entirety. 

[00551 The term "fimctional domain" with reference to glycosyltransferases, refers to a 
domain of the glycosyltransferase that confers or modulates an activity of the enzyme, e.g., 
acceptor substrate specificity, catalytic activity, binding affinity, or other biological or 
biochemical activity. Examples of fimctional domains of glycosyltransferases include, but 
are not limited to, the catalytic domain. 

10056] The terms "expression level" or "level of expression" with reference to a protein 
refers to the amount of a protein produced by a cell. The amount of protein produced by a 
cell can be measured by the assays and activity units described herein or known to one skilled 
in the art. One skUled in the art would know how to measure and describe the amount of 
protein produced by a cell using a variety of assays and units, respectively. Thus, the 
quantitation and quantitative description of the level of expression of a protein, e.g., an H. 
pylori fiicosyltransferase. can be assayed measuring the enzymatic activity or the units used 
to describe the activity, or the amount of protein. The amount of protein produced by a cell 
can be determined by standard known assays, for example, the protein assay by Bradford 
(1976), the bicinchoninic acid protein assay kit firom Pierce (Rockford, Illinois), or as 
described in U.S. Patent No. 5,641,668. 

[0057] The term "enzymatic activity" refers to an activity of an enzyme and may be 
measured by the assays and units described herein or known to one skilled in the art. 
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[0058] The term "specific activity" as used herein refers to the catalytic activity of an • 
enzyme, e.g., an H. pylori fucosyltransferase protein of the present invention, and may be 
expressed in activity units. As used herein, one activity unit catalyzes the formation of 1 
jimol of product per minute at a given temperature (e.g., at 37*^0) and pH value (e.g., at pH 
7.5). Thus, 10 imits of an enzyme is a catalytic amount of that enzyme where 10 nmol of 
substrate are converted to 10 ^mol of product in one minute at a temperature of; e.g., 37 
and a pH value of, e.g.y 7.5. 

[0059] A "catalytic domain" refers to a protein domain, or a subsequence thereof, that 
catalyzes an enzymatic reaction performed by the enzyme. For example, a catalytic domain 
of a fucosyltransferase will include a subsequence of the fucosyltransferase sufficient to 
transfer a fucose residue from a donor to an acceptor saccharide. A catalytic domain can 
mclude an entire enzyme, a subsequence thereof, or can include additional amino acid 
sequences that are not attached to the enzyme, or a subsequence thereof, as found in nature. 
The Q:-l,3/4-fucosyltranferase enzymes of the invention can also be recognized by the 
presence of highly conserved catalytic domains that are found in a family of 
fucosyltransferase proteins, glycosyltransferase family 10, see e.g., gnl|CDD| 16836 
pfam00852, Glyco_transf_10. Alignments between conserved catalytic domains of 1 182 
fiitB, 11 1 1 fiitA, 1218 fiitB, and 19C2 fiitB and a consensus sequence from the catalytic 
domain of glycosyltransferase family 10 members are shown in figures 8-11. Aligrmients 
between conserved catalytic domains of 1 1 82 fiitB, 1111 fiitA, 1218 futB, and 19C2 fiitB and 
a consensus sequence from the catalytic domain of glycosyltransferase family 10 members 
are shown in figures 8-11. Highly conserved regions, similar to a region of the 
glycosyltransferase family 10 catalytic domain consensus sequence starting at about amino 
acid 1 1 and ending at amino acid 301, are found in each of the H. pylori a-1,3/4- 
fucosyltranferase enzymes listed above, e.g., 1 182 fiitB, amino acids 23-305; 1111 fiitA, 
amino acids 27-304; 1218 fiitB, amino acids 23- 305; and 19C2 fiitB amino acids 22-277, and 
are believed to be the catalytic domains of the enzyme. Thus, polypeptides comprising the 
above-identified fiicosyltransferase catalytic domains can be used in the methods of the 
invention, e.g., fiicosylating glycoproteins. Nucleic acids that encode the above-identified 
fucosyltransferase catalytic domains can also be used in the methods of the invention, e.g., 
production of fiicosyltransferase proteins for fiicosylating glycoproteins. 

[0060] A "subsequence" refers to a sequence of nucleic acids or amino acids that comprise 
a part of a longer sequence of nucleic acids or amino acids (e.g., protein) respectively. 
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[00611 The term "nucleic acid" refers to a deoxyribonucleotide or ribonucleotide polymer 
in either single-or double-stranded form, and unless otherwise limited, encompasses known 
analogues of natural nucleotides fliat hybridize to nucleic acids in a manner similar to 
naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid 
sequence includes the complementary sequence thereof. 

[0062] A "recombinant expression cassette" or simply an "expression cassette" is a nucleic 
acid construct, generated recombinantly or synthetically, with nucleic acid elements that are 
capable of affecting expression of a structural gene in hosts compatible with such sequences. 
Expression cassettes include at least promoters and optionally, transcription temiination 
signals. Typically, the recombinant expression cassette includes a nucleic acid to be 
transcribed {e.g., a nucleic acid encoding a desired polypeptide), and a promoter. Additional 
factors necessary or helpful in effecting expression may also be used as described herem. For 
example, an expression cassette can also include nucleotide sequences that encode a signal 
sequence that directs secretion of an expressed protein from the host cell. Transcription 
termination signals, enhancers, and other nucleic acid sequences that influence gene 
expression, can also be included in an expression cassette. 

[0063] A "heterologous sequence" or a "heterologous nucleic acid", as used herein, is one 
that originates from a source foreign to the particular host cell, or, if from the same source, is 
modified from its original form. Thus, a heterologous glycoprotein gene in a eukaryotic host 
cell includes a glycoprotem-encoding gene that is endogenous to the particular host cell that 
has been modified. Modification of the heterologous sequence may occur, e,g., by treatmg 
the DNA with a restriction enzyme to generate a DNA fragment that is capable of being 
operably linked to the promoter. Techniques such as site-directed mutagenesis are also useful 
for modifying a heterologous sequence. 

[0064] The term "isolated" refers to material that is substantially or essentially free from 
components which interfere with the activity of an enzyme. For a saccharide, protein, or 
nucleic acid of the invention, the term "isolated" refers to material that is substantially or 
essentially free from components which normally accompany the material as found in its 
native state. Typically, an isolated saccharide, protein, or nucleic acid of the invention is at 
least about 80% pure, usually at least about 90%, and preferably at least about 95% pure as 
measured by band intensity on a silver stained gel or otlier method for determining purity. 
Purity or homogeneity can be indicated by a number of means well known in the art. For 
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example, a protein or nucleic acid in a sample can be resolved by polyacrylamide gel 
electrophoresis, and then the protein or nucleic acid can be visualized by staining. For certain 
purposes high resolution of the protein or nucleic acid may be desirable and HPLC or a 
similar means for purification, for example, may be utilized. 

5 [0065] The term "operably linked" refers to functional linkage between a nucleic acid 
expression control sequence (such as a promoter, signal sequence, or array of transcription 
factor binding sites) and a second nucleic acid sequence, wherein the expression control 
sequence affects transcription and/or translation of the nucleic acid corresponding to the 
second sequence. 

1 0 [0066] The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or protein sequences, refer to two or more sequences or subsequences that are the same 
or have a specified percentage of amino acid residues or nucleotides that are the same, when 
compared and aligned for maximum correspondence, as measured using one of the following 
sequence comparison algorithms or by visual inspection. 

1 5 (0067J The phrase "substantially identical," in the context of two nucleic acids or protems, 
refers to two or more sequences or subsequences that have greater than about 60% nucleic 
acid or amino acid sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% nucleotide or amino acid residue identity, 
when compared and aligned for maximum correspondence, as measured using one of the 

20 following sequence comparison algorithms or by visual inspection. Preferably, the 

substantial identity exists over a region of the sequences that is at least about 50 residues in 
length, more preferably over a region of at least about 1 00 residues, and most preferably the 
sequences are substantially identical over at least about 150 residues. In a most preferred 
embodiment, the sequences are substantially identical over the entire length of the coding 

25 regions. 

[0068] For sequence comparison, typically one sequence acts as a reference sequence, to 
which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are input into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program paramet^ are designated. The sequence 
30 comparison algorithm then calculates the percent sequence identity for the test sequence(s) 
relative to the reference sequence, based on the designated program parameters. 
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[00691 Optimal alignment of sequences for comparison can be conducted, e,g^, by the local 
homology algorithm of Smith & Waterman, Adv. AppL Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman & Wunsch, J. Mol Biol 48:443 (1970), by the search for 
similarity method ofPearson&Lipman,Proc.//a^ 7. ^carf.iS'c^ USA 85:2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by visual inspection {see generally^ Current Protocols in Moiecular 
Biology, F.M. Ausubel et al, eds.. Current Protocols, a joint venture between Greene 
Publishing Associates, hic. and John Wiley & Sons, Inc., (1995 Supplement) (Ausubel)). 

[0070] Examples of algorithms that are suitable for determining percent sequence identity 
and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in 
Altschul et al. (1990) J. Mol Biol. 215: 403-410 and Altschuel et al. (1977) Nucleic Acids 
Res. 25: 3389-3402, respectively. Software for performing BLAST analyses is pubhcly 
available through the National Center for Biotechnology hiforaiation 
(http://www.ncbi.nhn.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive-valued threshold score T when aUgned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are then extended in 
both directions along each sequence for as far as the cumulative aUgnment score can be 
increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters 
M (reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 
calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation 
(E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the 
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BLOSUM62 scoring matrix {see Henikoff & Henikoff, Proc. Natl. Acad. ScL USA 89:10915 
(1989)). 

[0071] In addition to calculating percent sequence identity, the BLAST algorithm also 
perfonns a statistical analysis of the similarity between two sequences (see, e.g., Karlin & 
Mtsc\m\,Proc.Nat'l.Acad.Sci. USA 90:5873-5787(1993)). One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an 
indication of the probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a nucleic acid is considered similar to a 
reference sequence if the smallest sum probability in a comparison of the test nucleic acid to 
the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and 
most preferably less than about 0.001. 

(0072J A further indication that two nucleic acid sequences or proteins are substantially 
identical is that the protein encoded by the first nucleic acid is immunologicaUy cross reactive 
with the protein encoded by the second nucleic acid, as described below. Thus, a protein is 
typically substantially identical to a second protein, for example, where the two peptides 
differ only by conservative substitutions. Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules hybridize to each other under stringent 
conditions, as described below. 

[0073] ITie phrase "hybridizing specifically to" refers to the binding, duplexing, or 
hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions 
when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. 

10074] The term "stringent conditions" refers to conditions under which a probe will 
hybridize to its target subsequence, but to no other sequences. Stringent conditions are 
sequence-dependent and will be different in different circumstances. Longer sequences 
hybridize specifically at higher temperatures. Generally, stringent conditions are selected to 
be about 15''C lower than the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the temperature (under defined ionic strength, pH, 
and nucleic acid concentration) at which 50% of the probes complementary to the target 
sequence hybridize to the target sequence at equilibrium. (As the target sequences are 
generally present in excess, at Tm, 50% of the probes are occupied at equilibrium). 
TypicaUy, stringent conditions wUl be those in which the salt concentration is less than about 
1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 
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8.3 and the temperature is at least about SO'^C for short probes (e.g.. 10 to 50 nucleotides) and 
at least about 60°C for long probes (e.g., greater than 50 nucleotides). Stringent conditions 
may also be achieved with the addition of destabilizing agents such as formamide. For 
selective or specific hybridization, a positive signal is typically at least two times 
background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% fonnamide, 5x SSC, and 1% SDS. 
incubating at 42° C, or. 5xSSC, 1% SDS. incubating at 65" C. with wash in 0.2x SSC, and 
0.1% SDS at 65° C. For PGR, a temperature of about 36° C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32-48° C depending 
on primer length. For high stringency PGR amplification, a temperature of about 62° C is 
typical, although high stringency annealing temperatures can range fi:om about 50° C to about 
65° C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90-95° C for 30-120 sec, 
an annealing phase lasting 30-120 sec, and an extension phase of about 72° C for 1-2 min. 
Protocols and guideUnes for low and high stringency amplification reactions are available, 
e.g., in Imiis, et al. (1990) PCR Protocols: A Guide to Methods and Applications Academic 
Press, N.Y. 

[00751 The phrases "specifically binds to a protein" or "specifically immunoreactive with", 
when referring to an antibody refers to a binding reaction which is detenninative of the 
presence of the protein in the presence of a heterogeneous population of proteins and other 
biologies. Thus, under designated immunoassay conditions, the specified antibodies bind 
preferentially to a particular protein and do not bind in a significant amount to other proteins 
present in tlie sample. Specific binding to a protein under such conditions requires an 
antibody that is selected for its specificity for a particular protein. A variety of immunoassay 
formats may be used to select antibodies specificaUy immunoreactive with a particular 
protein. For example, solid-phase ELISA immunoassays are routinely used to select 
monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane 
(1988) Antibodies. A Laboratory Manual, Cold Spring Harbor Publications, New York, for a 
description of immunoassay formats and conditions that can be used to determine specific 
immunoreactivity. 

[0076] "Conservatively modified variations" of a particular polynucleotide sequence ref^s 
to those polynucleotides that encode identical or essentiaUy identical amino acid sequences. 
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or where the polynucleotide does not encode an amino acid sequence, to essentially identical 
sequences. Because of the degeneracy of the genetic code, a large number of functionally 
identical nucleic acids encode any given protein. For instance, the codons CGU, CGC, CGA, 
CGG, AGA, and AGG all encode the amino acid arginine. Thus, at every position where an 
5 arginine is specified by a codon, the codon can be altered to any of the corresponding codons 
described without altering the encoded protein. Such nucleic acid variations are "silent 
variations," which are one species of "conservatively modified variations." Every 
polynucleotide sequence described herein which encodes a protein also describes every 
possible silent variation, except where otherwise noted. One of skill will recognize that each 
10 codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 
UGG which is ordinarily the only codon for tryptophan) can be modified to yield a 
fimctionally identical molecule by standard techniques. Accordingly, each "silent variation" 
of a nucleic acid which encodes a protein is implicit in each described sequence. 

[0077] Furthermore, one of skill will recognize that individual substitutions, deletions or 
IS additions which alter, add or delete a single amino acid or a small percentage of amino acids 
(typically less than 5%, more typically less than 1%) in an encoded sequence are 
"conservatively modified variations" where the alterations result in the substitution of an 
amino acid with a chemically similar amino acid. Conservative substitution tables providing 
fimctionally similar amino acids are well known in the art, 

20 [0078] One of skill will appreciate that many conservative variations of the fusion proteins 
and nucleic acid which encode the fusion proteins yield essentially identical products. For 
example, due to the degeneracy of the genetic code, "silent substitutions" (i.e., substitutions 
of a nucleic acid sequence which do not result in an alteration in an encoded protein) are an 
implied feature of every nucleic acid sequence which encodes an amino acid. As described 

25 herein, sequences are preferably optimized for expression in a particular host cell used to 
produce the chimeric glycosyltransferases {e.g., yeast, human, and the like). Similarly, 
"conservative amino acid substitutions," in one or a few amino acids in an amino acid 
sequence are substituted with dififerent amino acids with highly similar properties (see, the 
definitions section, siiprd)^ are also readily identified as being highly similar to a particular 

30 amino acid sequence, or to a particular nucleic acid sequence which encodes an amino acid- 
Such conservatively substituted variations of any particular sequence are a feature of the 
. present invention, ifee a/so, Creighton (1984) ProtoH^, W.H. Freeman and Company. In 
addition, individual substitutions, deletions or additions which alter, add or delete a single 



20 



wo 2004/009793 



PCT/US2003/023155 



amino acid or a small percentage of amino acids in an encoded sequence are also 
"conservatively modified variations". 

[0079] The practice of this invention can involve the construction of recombinant nucleic 
acids and the expression of genes in transfected host cells. Molecular cloning techniques to 
achieve these ends are known in the art. A wide variety of cloning and in vitro amplification 
methods suitable for the construction of recombinant nucleic acids such as expression vectors 
are well known to persons of skill. Examples of these techniques and instructions sufficient 
to direct persons of skill through many cloning exercises are found in Berger and BCimmel, 
Guide to Molecular Cloning Techniques, Methods in Eivzymology volume 152 Academic 
Press, Inc., San Diego, CA (Berger); and Current Protocols in Molecidar Biology, P.M. 
Ausubel et al, eds.. Current Protocols, a joint venture between Greene Publishing Associates, 
Inc. and John Wiley & Sons, Inc., (1999 Supplement) (Ausubel). Suitable host cells for 
expression of the recombinant H. pyloH fiicosyltransferases are known to those of skill in the 
art, and include, for example, bacterial cells, including E. coli. Eucaryotic cells can also be 
used in the present invention, for example insect cells such as Sf 9 cell and yeast or fungal 
cells (e.^., Aspergillus niger or yeast). 

[0080] Examples of protocols sufficient to direct persons of skill through in vitro 
amplification methods, including the polymerase chain reaction (PGR) the ligase chain 
reaction (LCR), QP-replicase amplification and other RNA polymerase mediated techniques 
are found in Berger, Sambrook, and Ausubel, as well as MuUis et al (1987) U.S. Patent No. 
4,683,202; PCR Protocols A Guide to Methods and Applications (Innis et al eds) Academic 
Press Inc. San Diego, CA (1990) (Innis); Amhetm & Levinson (October 1, 1990) Ct&EN 36- 
47; The Journal Of NIH Research (1991) 3: 81-94; (Kwoh etal (1989) Proa Natl. Acad. Set 
USA 86: 1 173; Guatelli et al. (1990) Proc. Natl. Acad. Set USA 87: 1874; Lomell et at 

(1989) J. Clin. Ghent. 35: 1826; Landegren et at (1988) Science 241: 1077-1080; Van Brunt 

(1990) Biotechnology 8: 291-294; Wu and Wallace (1989) Gene 4: 560; and Barringer et at 
(1990) Gene 89: 1 17. Improved methods of cloning in vitro amplified nucleic acids are 
described in Wallace et al, U.S. Pat. No. 5,426,039. 

DETAILED DESCRIPTION OF THE INVENTION 
[0081] The present invention provides for the first time bacterial q:-1,3/4- 
fiicosyltranferases, te., H. pylori fiicosyltransferases, that transfer fiicose firom a donor 
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substrate to an acceptor sugar on a glycoprotein. In addition, the fucosyltransferases are 
useful for producing fucosylated oligosaccharides and glycolipids. 

[0082] Specifically, a-l,3/4-fucosyltransferases firom the following H. pylori strains were 
cloned and analyzed: 915A2, 1 1 1 1 A2, 19C2B1, 1182B3, 19C2A5, 26695, and 1218. 
5 Fucosyltransferases fi-om the following H, pylori strains transferred fiicose to Glc-NAc: 
915A2, 1111A2, 19C2FutB, and 1182B3. The FutA gene product from/f. /7)Vor/ strain 
19C2A5 transferred fucose to the reducing glucose of the LNnT acceptor, as did the FutB 
gene product from H. pylori strain 1218. The ability of FutA gene product from H. pylori 
strain 26695 to transfer fiicose to glucose was confirmed. 

1 0 (0083) A major advantage of the H. pylori a- 1 ,3/4-fiicosyltranferases over mammalian a- 
1,3/4-fucosyltransferases is that the H. pylori enzyme appears to be unaffected by the 
sialylation status of the acceptor. In addition some of the H. pylori fucosyltransferases add 
fiicose exclusively to the N-acetylglucosamine (glcNAc) residue in acceptor sugars that 
contain both glucose and glcNAc residues. In contrast, mammalian an a-1 ,3/4- 

15 fiicosyltransferases are sensitive to the degree of sialylation of the acceptor and some 
mammalian enzymes add to both glucose and glcNAc residues in the same acceptor. In 
addition bacterially expressed enzymes offer a large cost savings relative to the expression of 
mammalian gene products in Sf9 or CHO systems. 



20 A. Cloning OfH. pylori a-l^/4-fucosyItranfera$es proteins 

[0084] Nucleic acids that encode glycosyltransferases, e.^., H. pylori a-l ,3/4- 
fiicosyltranferases and methods of obtaining such nucleic acids, are known to those of skill in 
the art. Suitable nucleic acids (e.g., cDNA, genomic, or subsequences (probes)) can be 
cloned, or amplified by in vitro methods such as the polymerase chain reaction (PGR), the 

25 ligase chain reaction (LCR), the transcription-based amplification system (TAS), or the self- 
sustained sequence replication system (SSR). A wide variety of cloning and in vitro 
amplification methodologies are well-known to persons of skill. Examples of these 
techniques and instmctions sufficient to direct persons of skill through many cloning 
exercises are found in Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods 

30 in Enzymology 1 52 Academic Press, Inc., San Diego, CA (Berger); Sambrook et al (1989) 
Molecular Cloning • A Laboratory Manual (2nd ed.) Vol. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Press, NY, (Sambrook et al); Current Protocols in 
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Molecular Biology, F.M. Ausubel et al, eds.. Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1994 Supplement) 
(Ausubel); Cashion et al, US. patent number 5,017,478; and Carr, European Patent No. 
0,246,864. 

[0085] A DNA that encodes an H. pylori Or 1 ,3/4-fucosyltranferase, or a subsequences 
thereof, can be prepared by any suitable method described above, including, for example, 
cloning and restriction of appropriate sequences with restriction enzymes. In one preferred 
embodiment, nucleic acids encoding K pylori a-l,3/4-fucosyItranferases are isolated by 
routine cloning methods. A nucleotide sequence of a pylori a-l,3/4-fucosyltranferase as 
provided in, for example, GenBank or other sequence database (see above) can be used to 
provide probes that specifically hybridize to a K pylori oi-l,3/4-fucosyltranferases gene in a 
genomic DNA sample, or to an mRNA, encoding an H. pylori a-l,3/4-fucosyltranferase, in a 
total RNA sample (e.g., in a Southem or Northern blot). Once the target nucleic acid 
encoding a H. pylori a-l,3/4-.fucosyltranferase is identified, it can be isolated according to 
standard methods known to those of skill m the art {see, e.g., Sambrook et al (1989) 
Molecular Cloning: A Laboratory Manual 2nd Ed., Vols. 1-3, Cold Spring Harbor 
Laboratory; Berger and Kimmel (1987) Methods in Enzymology, Vol 152: Guide to 
Molecular Cloning Techniques, San Diego: Academic Press, Inc.; or Ausubel et al (1987) 
Current Protocols in Molecular Biology, Greene PubUshing and Wiley-Interscience, New 
York). Further, the isolated nucleic acids can be cleaved with restriction enzymes to create 
nucleic acids encoding the full-length K pylori Qr-l,3/4-fucosyltranferase, or subsequences 
thereof e.g., containing subsequences encoding at least a subsequence of a catalytic domain 
of ajy. pylori a-l,3/4-fucosyltranferase. These restriction enzyme fragments, encoding an //. 
pylori a-l,3/4-fiicosyltranferase or subsequences thereof, may then be ligated, for example, to 
produce a nucleic acid encoding an H. pylori 05-l,3/4-fucosyltranferase protein. 

[0086] A nucleic acid encoding an H. pylori a- 1 ,3/4-fiicosyltranferase, or a subsequence 
thereof, can be characterized by assaying for the expressed product Assays based on the 
detection of the physical, chemical, or immunological properties of the expressed protein can 
be used. For example, one can identify a cloned H. pylori a-l,3/4-fiicosyltranferases, by the 
ability of a protem encoded by the nucleic acid to catalyze the transfer of a fiicose residue 
from a donor substrate to an acceptor substrate. In one method, capillary electrophoresis is 
employed to detect the reaction products. This highly sensitive assay involves using either 
saccharide or disaccharide aminophenyl derivatives which are labeled with fluorescein as 
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described in Wakarchuk et al (1996) J. BioL Chem. 271 (45): 28271-276. For example, to 
assay for a Neisseria IgtC enzyme, either FCHASE-AP-Lac or FCHASE-AP-Gal can be 
used, whereas for the Neisseria IgtB enzyme an appropriate reagent is FCHASE-AP-GlcNAc 
{Id.). Other methods for detection of a fucosylated reation product include thin layer 
5 chromatogr^hy and GC/MS. 

[0087] Also, a nucleic acid encoding an //. pylori a-l,3/4-fticosyltranferase, or a 
subsequence thereof, can be chemically synthesized. Suitable methods include the 
phosphotriester method of Narang et ai (1979) Meth. Enzymol. 68: 90-99; the phosphodiester 
method of Brown et al (1979) Meth. Enzymol 68: 109-151; the diethylphosphoramidite 

10 method of Beaucage et al (1981) Tetra. Lett, 22: 1859-1862; and the solid support method 
of U.S. Patent No. 4,458,066. Chemical synthesis produces a single stranded 
oligonucleotide. This can be converted into double stranded DNA by hybridization with a 
complementary sequence, or by polymerization with a DNA polymerase using the single 
strand as a template. One of skill recognizes that while chemical synthesis of DNA is often 

15 limited to sequences of about 100 bases, longer sequences may be obtained by the ligation of 
shorter sequences. 

[0088] Nucleic acids encoding K pylori a-l,3/4-fiicosyltranferases, or subsequences 
thereof, can be cloned using DNA amplification methods such as polymerase chain reaction 
(PGR). Thus, for example, the nucleic acid sequence or subsequence is PGR amplified, using 

20 a sense primer containing one restriction enzyme site (e.g., Ndel) and an antisense primer 
containing another restriction enzyme site (e.g., HindUI). This will produce a nucleic acid 
encoding the desired H. pylori a-l,3/4-fucosyltranferases or subsequence and having terminal 
restriction enzyme sites. This nucleic acid can then be easily ligated into a vector containing 
a nucleic acid encoding the second molecule and having the appropriate corresponding 

25 restriction enzyme sites. Suitable PGR primers can be detem^iined by one of skill in the art 
using the sequence information provided in GenBank or other sources. Appropriate 
restriction enzyme sites can also be added to the nucleic acid encoding the H. pylori a- 1,3/4- 
fiicosyltranferase protein or protein subsequence by site-directed mutagenesis. The plasmid 
containing the H. pylori a-l,3/4-fucosyltranferase-encoding nucleotide sequence or 

30 subsequence is cleaved with the appropriate restriction endonuclease and then ligated into an 
appropriate vector for amplification and/or expression according to standard methods. 
. Examples of techniques sufficient to direct persons of skill through in \dtro amplification 
methods are found in Berger, Sambrook, and Ausubel, as well as MulUs et al.y (1987) U.S. 
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Patent No. 4,683,202; PCR Protocols A Guide to Methods and Applications^ Qnr^s et aL, eds) 
Academic Press Inc. San Diego, CA (1990) (Innis); Amheim & Levinson (October 1, 1990) 
C&EN 36-47; Tlie Journal Of NIH Research (1991) 3: 81-94; (Kwoh et aL (1989) Proc. 
Natl Acad. ScL USA 86: 1 173; Guatelli et aL (1990) Proc. Natl Acad. ScL USA 87, 1874; 
LomeU et al (1989) J. Clin. Chem., 35: 1826; Landegren et al, (1988) Science 241: 1077- 
1080; Van Brunt (1990) Biotechnology 8: 291-294; Wu and Wallace (1989) Gene 4: 560; and 
Bamnger et al (1990) Gene 89: 117. 

[0089] Other physical properties of a cloned K pylori or-l ,3/4-fiicosyltranferase protein 
expressed from a particular nucleic acid, can be compared to properties of known K pylori ch 
1,3/4-fucosyltranferases to provide another method of identifying suitable sequences or 
domains of the H, pylori a- 1,3/4-fucosyltranferases that are determinants of acceptor 
substrate specificity and/or catalytic activity. Alternatively, a putative H. pylori (x-1,3/4- 
fiicosyltranferase gene or recombinant H. pylori of-l ,3/4-fucosyItranferase gene can be 
mutated, and its role as an a- 1,3/4-fucosyltranferases, or the role of particular sequences or 
domains established by detecting a variation in the structure of a carbohydrate nomially 
produced by the unmutated, naturally-occurring, or control a- 1,3/4-fucosyltranferases. 

[00901 Functional domains of cloned H. pylori a- 1 ,3/4-fucosyltranferases can be identified 
by using standard methods for mutating or modifying the g K pylori a-l ,3/4- 
fiicosyltranferases and testing the modified or mutated proteins for activities such as acceptor 
substrate activity and/or catalytic activity, as described herein. The functional domains of the 
various K pylori a- 1,3/4-fucosyltranferases can be used to construct nucleic acids encoding 
a- 1,3/4-fucosyltranferases proteins comprising the functional domains of one or more a- 
1,3/4-fucosyltranferases. These fusion proteins can then be tested for the desired acceptor 
substrate or catalytic activity. 

[0091] In an exemplary approach to cloning nucleic acids encoding a-1,3/4- 
fucosyltranferase proteins, the known nucleic acid or amino acid sequences of cloned 
glycosyltransferases are aligned and compared to determine the amount of sequence identity 
between various glycosyltransferases. This information can be used to identify and select 
protein domains that confer or modulate glycosyltransferase activities, e.^., acceptor substrate 
activity and/or catalytic activity based on the amoimt of sequence identity between the 
glycosyltransferases of interest. For example, domains having sequence identity between the 
fucosyltransferases of interest, and that are associated with a known activity, can be used to 
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construct fucosyitransferase proteins containing that domain, and having the activity 
associated with that domain (e.g., acceptor substrate specificity and/or catalytic activity). 

B, Fusion protein comprising accessory enzymes involved in nucleotide 
sugar formation 

[00921 In some embodiments, the fusion polypeptides of the invention include, in addition 
to the a-l,3/4-fucosyltranferases catalytic domain(s) and/or other functional domains, at least 
one catalytic domain from an accessory enzyme. Accessory enzymes include, for example, 
those enzymes that are involved in the formation of a nucleotide sugar. The accessory 
enzyme can be involved in attaching the sugar to a nucleotide, or can be involved in making 
the sugar or the nucleotide, for example. The nucleotide sugar is generally one that is utilized 
as a saccharide donor by the glycosyltransferase catalytic domain of the particular fusion 
polypeptide. Q:-l,3/4-fucosyItranferases utilize GDP-fucose as a sugar donor. Examples of 
fusion proteins comprising a functional domain from a glycosyltransferase and an accessory 
enzyme and methods to make such fusions are found for example in PCT/CA98/01 180, 
USSN 09/21 1,691 filed December 14, 1998 both of which are herein incorporated by 
reference for all purposes. 

[0093 J Accessory enzymes that are involved in synthesis of nucleotide sugars are well 
known to those of skill in the art. For a review of bacterial polysaccharide synthesis and gene 
nomenclature, see. e.g.. Reeves et aL, Trends Microbiol 4: 495-503 (1996). The methods 
described above for obtaining glycosyltransferase-encoding nucleic acids are also applicable 
to obtaining nucleic acids that encode enzymes involved in the formation of nucleotide 
sugars. For example, one can use one of nucleic acids known in the art, some of which are 
listed below, directly or as a probe to isolate a corresponding nucleic acid from other 
organisms of interest. 

(00941 An example of a fusion polypeptide provided by the invention is used for producing 
a fiicosylated soluble oligosaccharide. The donor nucleotide sugar for fucosyltransferases is 
GDP-fucose, which is relatively expensive to produce. To reduce the cost of producmg the 
fucosylated oligosaccharide, the invention provides fusion polypeptides that can convert the 
relatively inexpensive GDP-mannose into GDP-fucose, and then catalyze the transfer of the 
fucose to an acceptor saccharide. These fusion polypeptides include a catalytic domain from 
at least one of a GDP-maimose dehydratase, a GDP-4-keto-6-deoxy-D-mannose 3,5- 
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epimerase, or a GDP-4-keto-6-<leoxy-L-glucose 4-reductase. When each of these enzyme 
activities is provided, one can convert GDP-niannose into GDP-fucose. 

[0095] The nucleotide sequence of an E. coli gene cluster that encodes GDP-fucose- 
synthesizing enzymes is described by Stevenson et al. (1996) J. Bacterial 178: 4885-4893; 
GenBank Accession No. U38473). This gene cluster had been reported to include an open 
reading frame for GDP-mannose dehydratase (nucleotides 8633-9754; Stevenson et al, 
supra.). It was recently discovered that this gene cluster also contains an open reading frame 
fliat encodes an enzyme that has both 3,5 epimerization and 4-reductase activities (see, 
commonly assigned US Patent No. 6,500,661, issued December 31, 2002), and thus is 
capable of converting the product of the GDP-mannose dehydratase reaction (GDP-4-keto-6- 
deoxymannose) to GDP-fucose. This ORF, which is designated YEF B, is found between 
nucleotides 9757-10722. Prior to this discovery that YEF B encodes an enzyme having two 
activities, it was not known whether one or two enzymes were required for conversion of 
GDP-4-keto-6-deoxymamiose to GDP-fucose. The nucleotide sequence of a gene encoding 
the human Fx enzyme is found in GenBank Accession No. U58766. 

10096] Also provided are fusion polypeptides that include a mannosyltransferase catalytic 
domain and a catalytic domain of a GDP-Man pyrophosphorylase (EC 2.7.7.22), which 
converts Man-l-P to GDP-Man. Suitable genes are known from many organisms, including 
E. coli: GenBank U13629, AB010294, D43637 D13231, Bastin et al. Gene 164: 17-23 
(1995), Sugiyama et al, J. Bacterial 180: 2775-2778 (1998), Sugiyama et al. Microbiology 
140 (Pt 1): 59-71 (1994), Kido et al, J. Bacterial 177: 2178-2187 (1995); Klebsiella 
pneumoniae: GenBank AB010296, AB010295, Sugiyama et al, J. Bacterial 180: 2775-2778 
(1998); Salmonella enterica: GenBank X56793 M29713, Stevenson etal.,J. Bacterial 178: 
4885-4893 (1996). 

[0097] The fusion polypq>tides of the invention for fiicosylating a saccharide acceptor can 
also utilize enzymes that provide a minor or "scavenge" pathway for GDP-fucose formation. 
In this pathway, free fucose is phosphoiylated by fucokinase to form fucose 1-phosphate, 
which, along with guanosine 5'-tripho^hate (GTP), is used by GDP-fucose 
pyrophosphorylase to form GDP-fiicose (Gmsburg et al, J. Biol Chem.. 236: 2389-2393 
(1961) and Reitman, J. Biol Chem.. 255: 9900-9906 (1980)). Accordmgly, a 
fucosyltransferase catalytic domam can be hnked to a catalytic domain from a GDP-fucose 
pyrophosphorylase, for which suitable nucleic acids are described in copending, commonly 
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assigned US Patent Application Ser. No. 08/826,964, filed April 9, 1997. Fucokinase- 
encoding nucleic acids are described for, e.g., Haemophilus influenzae (Fleischmann et aL 
(1995) 269:496-512) and ^. coli{LazriAUn{\9%9)Nucleic Acids Res. 17:4883- 

4884). 

5 [0098] Additional accessory enzymes firom which one can obtain a catalytic domain are 
those that are involved in fonning reactants consumed in a glycosyltransferase cycle. For 
example, any of several phosphate kinases are useful as accessory enzymes. Polyphosphate 
kinase (EC 2.7.4.1), for example, catalyzes the fomiation of ATP; nucleoside phosphate 
kinases (EC 2.7.4.4) can form the respective nucleoside diphosphates; creatine phosphate 
10 kinase (EC 2.7.3.2); myokinase (EC 2.7.4.3); //-acetylglucosamine acetyl kinase (EC 
2.7.1.59); acetyl phosphate kinase; and pymvate kinase (EC 2.7.1.40). 

C. Expression cassettes and host cells for expressing recombinant H. pylori 
CK-l,3/4-fucosyItranferase proteins 

[00991 Fusion proteins of the invention can be expressed in a variety of host cells, 
15 including E, coli, other bacterial hosts, and yeast. The host cells are preferably 

microorganisms, such as, for example, yeast cells, bacterial cells, or filamentous fungal cells. 
Examples of suitable host cells include, for example, Azotobacter sp, (e.g., A. vinelandii), 
Psendomonas sp.^ Rhizobium sp., Erwinia sp., Escherichia sp. {e.g., E. coli). Bacillus, 
Pseudomonas, Proteus, Salmonella, Serratia, Shigella, Rhizobia, Vitreoscilla, Paracoccus 
20 and Klebsiella sp., among many others. The cells can be of any of several genera, including 
Saccharomyces {e.g., S. cerevisiae), Candida {e.g., C. utilis, C. parapsilosis, C h-usei, C. 
versatilis, C. lipolytica, C. zeylanoides, C. guilliennondii, C. albicans, and C, humicola), 
Pichia {e.g., P.farinosa and P. ohmeri), Torulopsis {e.g., T. Candida, T. sphaerica, T. xylinus, 
T.famata, and T. versatilis), Debaryomyces (e.g., D. subglobosus, D. cantarellii, D. 
25 globosus, D. hansenii, and D. japonicus), Zygosaccharomyces {e.g., Z. rotixii andZ. bailii), 
Kluyveromyces {e.g., K. marxianus), Hansenula {e.g., H. anomala and K Jadinii), and 
Brettanomyces {e.g., B, lambicus and B. anomalus). Examples of useful bacteria include, but 
are not limited to, Escherichia, Enterobacter, Azotobacter, Erwinia, Klebsiella. 

[0100] Typically, the polynucleotide that encodes the a- 1 ,3/4-fucosyltranferase protein is 
30 placed under the control of a promoter that is functional m the desired host cell. An 

extremely wide variety of promoters are well known, and can be used in the expression 
vectors of the invention, depending on the particular application. Ordinarily, the promoter 
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selected depends upon the cell in which the promoter is to be active. Other expression 
control sequences such as ribosome binding sites, transcription termination sites and the like 
are also optionally included. Constructs that include one or more of these control sequences 
are termed "expression cassettes." Accordingly, the invention provides expression cassettes 
5 into which the nucleic acids that encode fusion proteins are incorporated for high level 
expression in a desired host cell. 

[OIOIJ Expression control sequences that are suitable for use in a particular host cell are 
often obtained by cloning a gene that is expressed in that cell. Commonly used prokaryotic 
control sequences, which are defined herein to include promoters for transcription initiation, 
10 optionally with an operator, along with ribosome binding site sequences, include such 

commonly used promoters as the beta-lactamase (penicillinase) and lactose Qac) promoter 
systems (Change et al.. Nature (1977) 198: 1056), the tryptophan {tip) promoter system 
(Goeddel et al. Nucleic Acids Res. (1980) 8: 4057), the tac promoter (DeBoer, et aL. Proc. 
Natl. Acad. Sci. U.S.A. (1983) 80:21-25); and the lambda-derived Pl promoter and N-gene 
ribosome binding site (Shimatake et al.. Nature (1981) 292: 128). The particular promoter 
system is not critical to the invention, any available promoter that fimctions in prokaryotes 
can be used. 

[0102] For expression of af-l,3/4-fucosyltranferase proteins in prokaryotic ceUs other than 
E. coli, a promoter that fonctions in the particular prokaryotic species is required. Such 
promoters can be obtained from genes that have been cloned from the species, or 
heterologous promoters can be used. For example, the hybrid tij^-lac promoter fimctions in 
Bacillus in addition to E. coli. 

[0103J A ribosome binding site (RBS) is conveniently included in the expression cassettes 
of the invention. An RBS in ^. coli, for example, consists of a nucleotide sequence 3-9 
nucleotides in length located 3-11 nucleotides upstream of the initiation codon (Shine and 
Dalgamo,iVaft/re(1975)254:34; Steitz, In Biological regulation and development: Gene 
ejqiression (ed. R.F. Goldberger), vol. 1, p. 349, 1979, Plenum PubUshing, NY). 

[0104] For expression of the o;-l,3/4-fticosylh-anferase proteins in yeast, convenient 
promoters include GALl-10 (Johnson and Davies (1984) MoL Cell. Biol. 4:1440-1448) 
ADH2 (Russell et al. (1983) J. Biol. Chem. 258:2674-2682). PH05 {EMBO J. (1982) 6:675- 
680), and MFa (Herskowitz and Oshima (1982) in Tlie Molecular Biology of the Yeast 
Saccharomyces (eds. Strathem. Jones, and Broach) Cold Spring Harbor Lab., Cold Spring 
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Harbor, N.Y., pp. 181-209). Another suitable promoter for use in yeast is the ADH2/GAPDH 
hybrid promoter as described in Cousens et al.. Gene 61:265-275 (1987). For filamentous 
fiingi such as, for example, strains of the fimgi Aspergillus (McKnight et aL^ U.S. Patent No. 
4,935,349), examples of useful promoters include those derived from Aspergillus nidulans 
5 glycolytic genes, such as the ADH3 promoter (McKnight et al, EMBO J, 4: 2093 2099 

(1985)) and the tpiA promoter. An example of a suitable terminator is the ADH3 terminator 
(McKnight e/fl/.). 

[0105] Either constitutive or regulated promoters can be used in the present invention. 
Regulated promoters can be advantageous because the host cells can be grown to high 

10 densities before expression of the fusion proteins is induced. High level expression of 
heterologous proteins slows cell growth in some situations. An inducible promoter is a 
promoter that directs expression of a gene where the level of expression is alterable by 
environmental or developmental factors such as, for example, temperature, pH, anaerobic or 
aerobic conditions, light, transcription factors and chemicals. Such promoters are referred to 

15 herein as "inducible" promoters, which allow one to control the timing of expression of the 
glycosyltransferase or enzyme involved in nucleotide sugar synthesis. For coli and other 
bacterial host cells^ inducible promoters are known to those of skill in the art. These include, 
for example, the lac promoter, the bacteriophage lambda Pl promoter, the hybrid trp-lac 
promoter (Amann et al. (1983) Gene 25: 167; de Boer et al (1983) Proc, Nat'l Acad. Set 

20 USA 80: 21), and the bacteriophage T7 promoter (Studier et al (1986) 7. Mol BioL; Tabor et 
al. (1985) Proa Nat 7. Acad. Set. USA 82: 1074-8). These promoters and their use are 
discussed in Sambrook et aL, supra. A particularly preferred inducible promoter for 
expression in prokaryotes is a dual promoter that includes a tac promoter component linked 
to a promoter component obtained from a gene or genes that encode enzymes involved in 

25 galactose metabolism (e,g.y a promoter from a UDPgalactose 4-epimerase gene (ga/E)). The 
dual tac-gal promoter, which is described in PCT Patent Application Publ. No. WO98/201 11, 
[0106] A construct that includes a polynucleotide of interest operably linked to gene 
expression control signals that, when placed in an appropriate host cell, drive expression of 
the polynucleotide is termed an "expression cassette." Expression cassettes that encode the 

30 fusion proteins of the invention are often placed in expression vectors for introduction into 
the host cell. The vectors typically include, in addition to an expression cassette, a nucleic 
acid sequence that enables the vector to replicate independently in one or more selected host 
cells. Generally, this sequence is one that enables the vector to replicate independently of the 



30 



wo 2004/009793 



PCT/US2003/023155 



host chromosomal DNA, and includes origins of replication or autonomously replicating 
sequences. Such sequences are well known for a variety of bacteria. For instance, the origin 
of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria. 
Alternatively, the vector can replicate by becoming integrated into the host cell genomic 
complement and being repHcated as the cell undergoes DNA replication. A preferred 
expression vector for expression of the enzymes is in bacterial cells is pTGK, which includes 
a dual tac-gal promoter and is described in PCT Patent Application Publ. NO. WO98/201 1 1 . 

[01 07] The construction of polynucleotide constructs generally requires the use of vectors 
able to replicate in bacteria. A plethora of kits are commercially available for the purification 
of plasmids from bacteria {see, for example, EasyPrepJ, FlexiPrepJ, both from Pharmacia 
Biotech; StrataCleanJ, from Sfratagene; and, QIAexpress Expression System, Qiagen). The 
isolated and purified plasmids can then be fiirther manipulated to produce other plasmids, and 
used to transfect cells. Cloning in Streptomyces or Bacillus is also possible. 

[0108] Selectable markers are often incoiporated into the expression vectors used to 
express the polynucleotides of the invention. These genes can encode a gene product, such as 
a protein, necessary for the survival or growth of transformed host cells grown in a selective 
culture medium. Host cells not transformed with the vector containing the selection gene will 
not survive in the culture medium. Typical selection genes encode proteins that confer 
resistance to antibiotics or other toxins, such as ampicillin, neomycin, kanamycin, 
chloramphenicol, or tetracycUne. Alternatively, selectable markers may encode proteins that 
complement auxotrophic deficiencies or supply critical nutrients not available from complex 
media, e.g., the gene encoding D-alanine racemase for Bacilh. Often, the vector wiU have 
one selectable marker that is fiinctional in, e.g., E. coli, or other cells in which the vector is 
replicated prior to being infroduced into the host cell. A number of selectable markers are 
known to those of skUl in the art and are described for instance in Sambrook et at., supra. 

[01 09] Construction of suitable vectors containing one or more of fte above hsted 
components employs standard Ugation techniques as described in the references cited above. 
Isolated plasmids or DNA fragments are cleaved, tailored, and re-ligated in the fonn desired 
to generate the plasmids required. To confirm correct sequences in plasmids constructed, the 
plasmids can be analyzed by standard techniques such as by restriction endonuclease 
digestion, and/or sequencing according to known methods. Molecular cloning techniques to 
achieve these ends are known in the art. A wide variety of cloning and in vitro amplification 
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methods suitable for the construction of recombinant nucleic acids are well-known to persons 
of skill. Examples of these techniques and instructions sufficient to direct persons of skill 
through many cloning exercises are found in Berger and Kimmel, Guide to Molecular 
Cloning Techniques, Methods in Enzymology, Volume 152, Academic Press, Inc., San Diego, 
5 CA (Berger); and durent Protocols in Molecular Biology, P.M. Ausubel et al, eds.. Current 
Protocols^ a joint venture between Greene Publishing Associates, Inc. and John Wiley & 
Sons, Inc., (1998 Supplement) (Ausubel). 

[0110] A variety of common vectors suitable for use as starting materials for constructing 
the expression vectors of the invention are well known in the art. For cloning in bacteria, 

10 common vectors include pBR322 derived vectors such as pBLUESCRIPT™, and X-phage 
derived vectors. In yeast, vectors include Yeast Integrating plasmids {e.g., YIp5) and Yeast 
Replicating plasmids (the YRp series plasmids) and pGPD-2. Expression in mammalian cells 
can be achieved using a variety of commonly available plasmids, including pSV2, pBC12BI, 
and p91023, as well as lytic vims vectors {e.g., vaccinia virus, adeno vims, and baculovirus), 

15 episomal virus vectors {e.g., bovine papiUomavirus), and retroviral vectors {e.g., murine 
retroviruses). 

[01 11] The methods for introducing the expression vectors into a chosen host cell are not 
particularly critical, and such methods are known to those of skill in the art. For example, the 
expression vectors can be introduced into prokaryotic cells, including E. coli^ by calcium 
20 chloride transformation, and into eukaryotic cells by calcium phosphate treatment or 
electroporation. Other transformation methods are also suitable. 

[0112] Translational coupling may be used to enhance expression. The strategy uses a 
short upstream open reading frame derived from a highly expressed gene native to the 
translational system, which is placed downstream of the promoter, and a ribosome binding 
25 site followed after a few amino acid codons by a termination codon. Just prior to the 

termination codon is a second ribosome binding site, and following the termination codon is a 
start codon for the initiation of translation. The system dissolves secondary structure in the 
RNA, allowing for the efficient initiation of translation. See Squires, et. al. (1988), J. BioL 
Chem. 263: 16297-16302. 

30 [01 13] The OE-l,3/4-fucosyltranferase proteins can be expressed intracellularly, or can be 
secreted from tiie cell. Intracellular expression often results in high yields. If necessary, the 
amount of soluble, active fiision protein may be increased by performing refolding 

32 



wo 2004/009793 PCT/US2003/023155 

procedures {see, e.g., Sambrook et ah, supra.; Marston et al., Bio/Technology (1984) 2: 800; 
Schonecetal., Bio/Technology (1985) 3: 151). In embodiments in which the a-1,3/4- 
fiicosyltranferase proteins are secreted from the cell, either into the periplasm or into the 
extracellular medium, the DNA sequence is linked to a cleavable signal peptide sequence.. 
The signal sequence directs translocation of the fusion protein through the cell membrane. 
An example of a suitable vector for use in E. coli that contains a promoter-signal sequence 
unit is pTA1529, which has the E. coliphoK promoter and signal sequence {see, e.g., 
Sambrook et al. supra.; Oka et al, Proc. Natl Acad. Sci. USA (1985) 82: 7212; Tahnadge et 
al, Proc. Natl Acad. Sci. USA (1980) 77: 3988; Takahara et al, J. Biol Chem. (1985) 260: 
2670). hi another embodiment, the fusion proteins are fused to a subsequence of protein A or 
bovine senim albumin (BSA). for example, to facilitate purification, secretion, or stability. 

[0114] The a-l,3/4-fucosyltranferase proteins of the invention can also be further linked to 
other bacterial proteins. This approach often results in high yields, because normal 
prokaryotic control sequences direct transcription and translation, hi E. coli, lacZ fusions are 
often used to express heterologous proteins. Suitable vectors are readily available, such as 
the pUR, pEX, and pMRIOO series {see. e.g., Sambrook et al., supra.). For certain 
applications, it may be desirable to cleave tlie non-glycosyltransferase and/or accessory 
enzyme amino acids from the fiision protein after purification. This can be accompUshed by 
any of several methods known in the art, including cleavage by cyanogen bromide, a 
protease, or by Factor {see. e.g., Sambrook et al, supra.; Itakura et al. Science (1977) 
198: 1056; Goeddel et al, Proc. Natl Acad. Sci. USA (1979) 76: 106; Nagai et al. Nature 
(1984) 309: 810; Sung et al, Proc. Natl Acad. Sci. USA (1986) 83: 561). Cleavage sites can 
be engineered into the gene for the fiision protein at the desired point of cleavage. 

[01151 More than one recombinant protein may be expressed in a single host cell by 
placing multiple transcriptional cassettes in a single expression vector, or by utiUzing 
different selectable markers for each of the expression vectors which are employed in the 
cloning strategy. 

[01 16) A suitable system for obtaining recombinant proteins from E. coli which maintains 
the integrity of their N-tennini has been described by MiUer et al Biotechnology 7:698-704 
(1989). hi this system, the gene of interest is produced as a C-termmal fiision to the first 76 
residues of the yeast ubiquitin gene containing a peptidase cleavage site. Cleavage at the 
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junction of the two moieties results in production of a protein having an intact authentic 
terminal reside. 

D. Purification of CK-13/4-fucosyltranferase proteins 
[01 1 7] The H. pylori fiicosyltransferase proteins of the present invention can be expressed 
as intracellular proteins or as proteins that are secreted from the cell, and can be used in this 
form, in the methods of the present invention. For example, a crude cellular extract 
containing the expressed intracellular or secreted H. pyloH fiicosyltransferase protein can 
used in the methods of the present invention. 

[0118] Alternatively, the H. pylori fiicosyltransferase proteins can be purified according to 
standard procedures of the art, including ammonium sulfate precipitation, affinity columns, 
column chromatography, gel electrophoresis and the like {see^ generally, R. Scopes, Protein 
Purification, Springer-Verlag, N.Y. (1982), Deutscher, Methods in Enzymology Vol 182: 
Guide to Protein Purification,, Academic Press, hic. N.Y. (1990)). Substantially pure 
compositions of at least about 70 to 90% homogeneity are preferred, and 98 to 99% or more 
homogeneity are most preferred. The purified proteins may also be used, e,g., as 
immunogens for antibody production. 

[0119] To facihtate purification of the H, pylori a-l,3/4~fucosyltranferase proteins of the 
invention, the nucleic acids that encode the fiision proteins can also include a coding 
sequence for an epitope or "tag" for which an affinity binding reagent is available, Le, a 
purification tag. Examples of suitable epitopes include the myc and V-5 reporter genes; 
expression vectors usefiil for recombinant production of fiision proteins having these epitopes 
are commercially available (e.^., Invitrogen (Carlsbad CA) vectors pcDNA3.1/Myc-His and 
pcDNA3. W5-His are suitable for expression in mammalian cells). Additional expression 
vectors suitable for attaching a tag to the H. pylori a-l,3/4-fiicosyltranferase proteins of the 
invention, and corresponding detection systems are known to those of skill in the art, and 
several are commercially available (e.g., FLAG" (Kodak, Rochester NY). Another example 
of a suitable tag is a polyhistidine sequence, which is capable of binding to metal chelate 
afiBnity ligands. Typically, six adjacent histidines are used, although one can use more or less 
than six. Suitable metal chelate affinity ligands that can serve as the binding moiety for a 
polyhistidine tag include nitrilo-tri-acetic acid (NTA) (Hochuli, E. (1990) "Purification of 
recombinant proteins with metal chelating adsorbents" In Genetic Engineering: Principles 
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and Methods, J.K. Setlow. Ed., Plenum Press. NY; conunerciaUy available from Qiagen 
(Santa Clarita, CA)). 

[0120] Purification tags also include maltose binding domains and starch binding domains. 
Purification of maltose binding domain proteins is know to those of skill in the art. Starch 
binding domains are described in WO 99/15636, herein incoiporated by reference. Affinity 
purification of a fiision protein comprising a starch binding domain using a betacylodextrin 
(BCD)-derivatized resin is described in USSN 60/468,374, filed May 5, 2003, herein 
incoqjorated by reference in its entirety. 

(01211 Other haptens that are suitable for use as tags are known to those of skill in the art 
and are described, for example, in the Handbook of Fluorescent Probes and Research 
Chemicals (6th Ed., Molecular Probes, Inc., Eugene OR). For example, dinitrophenol (DNP), 
digoxigenin, barbiturates (see, e.g., US Patent No. 5,414,085), and several types of 
fluorophores are usefiil as haptens, as are derivatives of these compounds. Kits are 
commercially available for linking haptens and other moieties to proteins and other 
molecules. For example, where the hapten includes a thiol, a heterobifimctional linker such as 
SMCC can be used to attach the tag to lysine residues present on the capture reagent. 

[0122] One of skill would recognize that modifications can be made to the tt-1,3/4- 
fucosyltranferase catalytic or fimctional domains without dimmishing flieir biological 
activity. Some modifications may be made to facilitate the cloning, expression, or 
incorporation of the catalytic domain into a fiision protein. Such modifications are well 
known to those of skUl in the art and include, for example, the addition of codons at either 
terminus of the polynucleotide that encodes the catalytic domain to provide, for example, a 
methionine added at the amino terminus to provide an initiation site, or additional amino 
acids (e.g., poly His) placed on either terminus to create conveniently located restriction 
enzyme sites or termination codons or purification sequences. 

E. Uses of the H. pylori fucosyltransferase proteins 
[0123] The invention provides H. pylori a-l,3/4-fiicosylti:anferase proteins and methods of 
using the H. pylori of-l,3/4-fiicosyltranferase proteins to enzymatically synthesize 
glycoproteins, glycolipids, and oligosaccharide moieties. The glycosyltransferase reactions 
of the invention take place in a reaction medium comprising at least one H. pylori a-1,3/4- 
fiicosyltranferase, acceptor substrate, and donor substiate, and typically a soluble divalent 
metal cation. In some embodiments, accessory enzymes and subsfrates for the accessory 
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enzyme catalytic moiety are also present, so that the accessory enzymes can synttiesize the 
donor substrate for the H, pylori ct-l,3/4-fticosyltranferase. 

[0124] A number of methods of using glycosyltransferases to synthesize glycoproteins and 
glycolipids having desired oligosaccharide moieties are known. Exemplary methods are 
5 described, for instance, WO 96/32491, Ito et al (1993) PureAppl Chem. 65: 753, and US 
Patents 5, 352,670, 5,374,541, and 5,545,553. 

[0125] The H. pylori fucosyltransferase proteins prepared as described herein can be used 
in combination with additional glycosyltransferases. For example, one can use a combination 
of recombinant sialyltransferase fusion protein and a recombinant H. pylori oe-1,3/4- 

10 fucosyltranferases. By conducting two glycosyltransferase reactions in sequence in a single 
vessel, overall yields are improved over procedures in which an intermediate species is 
isolated. Moreover, cleanup and disposal of extra solvents and by-products is reduced. 
Similarly, the recombinant glycoosyltransferases can be used with recombinant accessory 
enzyme, which may or may not be present as a the fusion protein. In other embodiments, the 

15 H. pylori a-l,3/4-fucosyltranferase and additional glycosyltransferases or accessory enzymes 
are produced in the same cell and used to synthesize a desired end product. 

[0126] The products produced by the above processes can be used without purification. 
However, standard, well known techniques, for example, thin or thick layer chromatography, 
ion exchange chromatography, or membrane filtration can be used for recovery of 

20 glycosylated saccharides. Also, for example, membrane filtration, utilizing a nanofiltration 
or reverse osmotic membrane as described in commonly assigned AU Patent No. 735695 
may be used. As a further example, membrane filtration wherein the membranes have a 
molecular weight cutoff of about 1000 to about 10,000 Daltons can be used to remove 
proteins. As another example, nanofiltration or reverse osmosis can then be used to remove 

25 salts. Nanofilter membranes are a class of reverse osmosis membranes which pass 

monovalent salts but retain polyvalent salts and uncharged solutes larger than about 200 to 
about 1000 Daltons, depending upon the membrane used. Thus, for example, the 
oligosaccharides produced by the compositions and methods of the present invention can be 
retained in the membrane and contaminating salts will pass through. 

30 F. Donor Substrates and Acceptor Substrates 

[0127] Suitable donor substrates used by the H. pylori fucosyltransferase proteins and other 
glycosyltranferases in the methods of the invention include, but are not limited to, UDP-Glc, 
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UDP-GIcNAc, UDP-Gal, UDP-GalNAc, GDP-Man, GDP-Fuc, UDP-GlcUA, and CMP-siaHc 
acid. Guo et al.. Applied Biochem. and Biotech. 68: 1-20 (1997) 

{0128] Suitable acceptor substrates used by the H. pylori fucosyltransferase proteins and 
methods of the invention include, but are not limited to, polysaccharides, oligosaccharides, 
5 lipids, and glycolipids. For example, the oligosaccharide LNnT can be fucosylated to form 
LNFIIL The fucosyltmaferases described herein can also be used in multienzyme systems to 
produce a desired product from a convenient starting material. For example, LNFIO was 
prepared on a multigram scale from lactose using the H. pylori a-l,3/4-fiicosyItranferases 
from strain 1 182 described herein, in combination with Neisseria gonococcus j8-l,3N- 
1 0 acetylglucosaminylfransferase (IgtA) and Neisseria gonococcus 0- 1 ,4-galactosyltransferase 
(IgtB). 

[01291 Suitable acceptor substrates used by the H. pylori fucosylfransferase proteins and 
methods of the invention include, but are not limited to, proteins, lipids, gangliosides and 
other biological structures (e.g., whole cells) that can be modified by the methods of the 
1 5 invention. Exemplary structures, which can be modified by the methods of the invention 
include any a of a number glycolipids, glycoproteins and carbohydrate structures on cells 
known to those skilled in the art as set forth is Table I . 
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Table 1 



Hormones and Growth Factors 


Receptors and Chimeric Receotors 


• G-CSF 


• CD4 


• GM-CSF 


• Tumor Necrosis Factor (TNF) receptor 


• TPO 


. Alpha-CD20 


. EPO 


. MAb-CD20 


• EPO variants 


. ]VIAb-alpha-CD3 


• a-TNF 


• MAb-TNF receptor 


• Leptin 


• MAb-CD4 




. PSGL-1 


Enzvmes and Inhibitors 


• MAb-PSGL-1 


• t-PA 


• Complement 


• t-PA variants 


• GlyCAM or its chimera 


• Urokinase 


• N-CAM or its chimera 


• Factors VH, VHI, DC, X 


• LFA-3 




. CTLA-IV 


• Glucocerebrosidase 




• Hirudin 


Monoclonal Antibodies (Immunoglobulins) 


• al antitrypsin 


• MAb-anti-RSV 


• Antithrombin m 


• MAb-anti-IL-2 receptor 




• MAb-anti-CEA 


Cytokines and Chimeric 


• MAb-anti-platelet Hb/IIIa receptor 


Cvtokine55 


• MAb-anti-EGF 


• hiterleukin-1 (IL-1), IB, 


• MAb-anti-Her-2 receptor 


2, 3,4 




• Interferon-a (IFN-a) 


Cells 


. IFN-a-2b 


• Red blood cells 


. IFN-p 


• White blood cells T cells, B ceUs, dendritic 


• IFN-y 


cells, macrophages, NK cells, neutrophils, monocytes 


. Chimeric diptheria toxin- 


and the like 


IL-2 


• Stem cells 
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[0130] Examples of suitable acceptor substrates used in fucosyltransferase-catalyzed 
reactions, and examples of suitable acceptor substrates used in sialyltransferase-catalyzed 
reactions are described in Guo et al , Applied Biochem. and Biotech. 68: 1-20(1 997), but are 
not limited thereto. 

5 [0131J The present invention provides K pylori focosyltransferase proteins {e.g. , 

fucosyltransferases) that are selected for their ability to produce glycoproteins and glycolipids 
having desired oligosaccharide moieties. Similarly, if present, accessory enzymes are chosen 
based on an desired activated sugar substrate or on a sugar found on the product 
oligosaccharide. 

1 0 (01 32] One can readily identify suitable H. pylori fucosyltransferase proteins by reacting 
various amounts ofaK pylori a-l,3/4-fucosyltranferase protein of interest (e.g., 0.01-100 
mU/mg protein) with a glycoprotein (e.g., at 1-10 mg/ml) to which is linked an 
oligosaccharide that has a potential acceptor site for glycosylation by the fusion protein of 
interest. The abilities of the recombinant glycosyltransferases fusion proteins of the present 

15 invention to add a sugar residue at the desired acceptor site are compared, and a H. pylori 
fucosyltransferase protein having the desired property (e.g., acceptor substrate specificity or 
catalytic activity) is selected. 

[0133] In general, the efficacy of the enzymatic synthesis of glycoproteins and glycolipids, 
having desired oligosaccharide moieties, can be enhanced through use of recombinantly 
20 produced H. pylori 0£-l,3/4-fucosyltranferasesproteins of the present invention. Recombinant 
techniques enable production of the recombinant H. pylori a-l,3/4-fucosyltranferase proteins 
in the large amounts that are required for large-scale glycoprotein and glycolipid 
modification. 

[0134] Suitable glycoproteins and glycolipids for use by the H. pylori focosyltransferase 
25 proteins and methods of the invention can be glycoproteins and glycolipids immobilized on a 
solid support during the glycosylation reaction. The term "solid support" also encompasses 
semi-soUd supports. Preferably, the target glycoprotein or glycolipid is reversibly 
inomobilized so that the respective glycoprotein or glycolipid can be released after the 
glycosylation reaction is completed. Many suitable matrices are known to those of skill in 
30 the art. Ion exchange, for example, can be employed to temporarily immobilize a 

glycoprotein or glycolipid on an 2q)propriate resin while the glycosylation reaction proceeds. 
A ligand that specifically binds to the glycoprotein or glycolipid of interest can also be used 

39 



wo 2004/009793 



PCT/US2003/023155 



for affinity-based immobilization. For example, antibodies that specifically bind to a 
glycoprotein are suitable. Also, where the glycoprotein of interest is itself an antibody or 
contains a firagment fliereof, one can use protein A or G as the afiBnity resin. Dyes and other 
molecules that specifically bind to a glycoprotein or glycolipid of interest are also suitable. 

5 [0135] The recombinant fiision protein of the invention can be constructed and expressed 
as a fusion protein with a molecular "tag" at one end, which facilitates purification of the 
protein, i.e., a purification tag. Such tags can also be used for immobilization of a protein of 
interest during the glycosylation reaction. Suitable tags include "epitope tags," which are a 
protein sequence that is specifically recognized by an antibody. Epitope tags are generally 

10 incorporated into fusion proteins to enable the use of a readily available antibody to 

unambiguously detect or isolate the fusion protein. A *TLAG tag" is a conunonly used 
epitope tag, specifically recognized by a monoclonal anti-FLAG antibody, consisting of the 
sequence AspTyrLysAspAspAsp AspLys or a substantially identical variant thereof. A mcy 
tag is another commonly used epitope tag. Other suitable tags are known to those of skill in 

15 the art, and include, for example, an affinity tag such as a hexahistidine peptide, which will 
bind to metal ions such as nickel or cobalt ions. Purification tags also include maltose 
binding domains and starch binding domains. Purification of maltose binding domain 
proteins is know to those of skill in the art. Starch binding domains are described in WO 
99/15636, herein incorporated by reference. Affinity purification of a fusion protein 

20 comprising a starch binding domain using a betacylodextrin (BCD)-derivatized resin is 
described in USSN 60/468,374, filed May 5, 2003, herein incorporated by reference in its 
entirety. 

[0136] Preferably, when the glycoprotein is a truncated version of the full-length 
glycoprotein, it preferably includes the biologically active subsequence of the full-length 
25 glycoprotein. Exemplary biologically active subsequences include, but are not limited to, 

enzyme active sites, receptor binding sites, ligand bmding sites, complementarity determining 
regions of antibodies, and antigenic regions of antigens. 

[0137] In some embodiments, the H, pylori fucosyltransferase proteins and methods of the 
present invention are used to enzymatically synthesize a glycoprotein or glycolipid that has a 
30 substantially uniform glycosylation pattern. The glycoproteins and glycolipids include a 
saccharide or oligosaccharide that is attached to a protein, glycoprotein, hpid, or glycolipid 
for which a glycoforai alteration is desired. The saccharide or ohgosaccharide includes a 
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structure that can function as an acceptor substrate for a glycosyltransferase. When the 
acceptor substrate is glycosylated, the desired ohgosaccharide moiety is formed. The desired 
oligosaccharide moiety is one that imparts the desired biological activity upon the 
glycoprotein or glycolipid to which it is attached. In the compositions of the invention, the 
5 preselected saccharide residue is linked to at least about 30% of the potential acceptor sites of 
interest. More preferably, the preselected saccharide residue is linked to at least about 50% 
of the potential acceptor substrates of interest, and stUl more preferably to at least 70% of the 
potential acceptor substrates of interest In situations in which the starting glycoprotein or 
glycolipid exhibits heterogeneity in the oligosaccharide moiety of interest (e.g., some of the 
10 oligosaccharides on the starting glycoprotein or glycolipid already have the preselected 
saccharide residue attached to the acceptor substrate of interest), the recited percentages 
include such pre-attached saccharide residues. 

[0138] The term "altered" refers to the glycoprotein or glycohpid of interest having a 
glycosylation pattern that, after application of the H. pylori fucosyltransferase proteins and 

15 methods of the invention, is different from that observed on the glycoprotein as originally 

produced. An example of such glycoconjugates are glycoproteins in which the glycoforms of 
the glycoproteins are different from those found on the glycoprotein when it is produced by 
cells of the organism to which the glycoprotein is native. Also provided are K pylori 
fucosyltransferase proteins and methods of using such fusion proteins for enzymatically 

20 synthesizing glycoproteins and glycoUpids in which the glycosylation pattern of these 

glycoconjugates are modified compared to the glycosylation pattern of the glycoconjugates as 
originally produced by a host cell, which can be of the same or a different species than the 
cells from which the native glycoconjugates are produced. 

[0139] One can assess differences in glycosylation patterns not only by stractural analysis 
25 of the glycoproteins and glycolipids, but also by comparison of one or more biological 

activities of the glycoconjugates. For example, a glycoprotein having an "altered glycoform" 
includes one that exhibits an improvement in one more biological activities of the 
glycoprotein after the glycosylation reaction compared to the unmodified glycoprotein. For 
example, an altered glycoconjugate includes one that, after application of the H. pylori 
30 fucosyltransferase proteins and methods of the invention, exhibits a greater binding afBnity 
for a ligand or receptor of interest, a greater therapeutic half-life, reduced antigenicity, and 
targeting to specific tissues. The amount of improvement observed is preferably statistically 
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significant, and is more preferably at least about a 25% improvement, and still more 
preferably is at least about 50%, 60%, 70%, and even still more preferably is at least 80%. 

G. Fucosyltransferase reactions 
[0140] The H. pylori fucosyltransferase proteins, acceptor substrates, donor substrates and 
5 other reaction mixture ingredients, including other glycosyltransferases and accessory 

enzymes are combined by admixture in an aqueous reaction medium. The medium generally 
has a pH value of about 5 to about 8.5. The selection of a medium is based on the ability of 
the medium to maintain pH value at the desired level. Thus, in some embodiments, the 
medium is buffered to a pH value of about 7.5. If a buffer is not used, the pH of the medium 
10 should be maintained at about 5 to 8.5, depending upon the particular glycosyltransferase 
used. For fucosyltransferases, the pH range is preferably maintained fi^om about 6.0 to 8.0. 
For sialyltransferases, the range is preferably from about 5.5 and about 7.5. 

[0141] Enzyme amounts or concentrations are expressed in activity units, which is a 
measure of the initial rate of catalysis. One activity unit catalyzes the fomiation of 1 fimol of 
15 product per minute at a given temperature (typically 37*^C) and pH value (typically 7.5). 

Thus, 10 units of an enzyme is a catalytic amount of that enzyme where 10 \xxxio\ of substrate 
are converted to 10 |miol of product in one minute at a temperature of 37 °C and a pH value 
of7.5. 

[0142] The reaction mixture may include divalent metal cations (Mg^*, Mn^^, The 
20 reaction medium may also comprise solubilizing detergents {e.g., Triton or SDS) and organic 
solvents such as methanol or ethanol, if necessary. The enzymes can be utilized free in 
solution or can be boimd to a support such as a polymer. The reaction mixture is thus 
substantially homogeneous at the beginning, although some precipitate can form during the 
reaction. 

25 [0143] The temperature at which an above process is carried out can range from just above 
freezing to the temperature at which the most sensitive enzyme denatures. That temperature 
range is preferably about O^'C to about 45^C, and more preferably at about 20°C to about 
37°C. 

[0144] The reaction mixture so formed is maintained for a period of time sufficient to 
30. obtain the desired high yield of desired oligosaccharide products, including determinants 

present on oligosaccharide groups attached to the glycoprotein to be glycosylated. For large- 
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scale preparations, the reaction will often be allowed to proceed for between about 0.5-240 
hours, and more typically between about 1-18 hours. 

[0145] In embodiments in which more than one glycosyltransferase is used to obtain the 
oligosaccharide products, the enzymes and reagents for a second glycosyltransferase reaction 
can be added to the reaction medium once the first glycosyltransferase reaction has neared 
completion. For some combinations of enzymes, the glycosyltransferases and corresponding 
substrates can be combined in a single initial reaction mixture; the enzymes in such 
simultaneous reactions preferably do not fonn a product that cannot serve as an acceptor for 
the other enzyme. By conducting two glycosyltransferase reactions in sequence in a single 
vessel, overall yields are improved over procedures in which an intermediate species is 
isolated. Moreover, cleanup and disposal of extra solvents and by-products is reduced. In 
addition, in some embodiments, the fucosyltransferase and additionaUy glycosyltransferases 
or accessory enzymes are expressed in the same host cell and the desired product is 
synthesized within the host cell. 

10146] One or more of the glycosyltransferase reactions can be carried out as part of a 
glycosyltransferase cycle. Preferred conditions and descriptions of glycosyltransferase cycles 
have been described. A number of glycosyltransferase cycles (for example, sialyltransferase 
cycles, galactosyltransferase cycles, and fucosyltransferase cycles) are described in U.S. 
Patent No. 5,374,541 and WO 9425615 A. Other glycosyltransferase cycles are described in 
Ichikawa et al. J. Am. Chem. Soc. 1 14:9283 (1992), Wong et al. J. Org. Chem. 57: 4343 

(1992) , DeLuca, et al., J. Am. Chem. Soc. 117:5869-5870 (1995), and Ichikawa et al. hi 
Carbohydrates and Carbohydrate Polymers. Yaltanii, ed. (ATL Press, 1993). 

[0147] Other glycosyltransferases can be substituted into similar transferase cycles as have 
been described in detail for the fucosyltransferases and sialyltransferases. In particular, the 
glycosyltransferase can also be, for instance, glucosyltransferases, e.g., Alg8 (Stagljov et al. 
Proc. Natl. Acad. Sci. USA 91:5977 (1994)) or Alg5 (Heesen et al. Eur. J. Biochem. 224:71 
(1994)), N-acetylgalactosaminyltransferases such as, for example, a(l,3) N- 
acetylgalactosaminyltransferase, p(l,4) N-acetylgalactosaminyltransferases (Nagata etal. J. 
Biol Chem. 267:12082-12089 (1992) and Smith et al J. Biol Chem. 269:15162 (1994)) and 
polypeptide N-acetylgalactosaminyltransferase (Homa et al J. Biol Chem. 268:12609 

(1993) ). Suitable N-acetylglucosaminyltransferases include GnTI (2.4.1.101, Hull et al, 
BBRC 176:608 (1991)), GnTH, and GnTEI (Iharae/a/. J. Biochem. 113:692 (1993)), GnTV 
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(Shoreibane/a/. J, Biol Chem. 268: 15381 (1993)), O-linkedN- 

acetylglucosaminyltransferase (Bierhuizen et al, Proc, Natl Acad. Set USA 89:9326 (1992)), 
N-acetylglucosamine-l -phosphate transferase (Rajput et al Biochem J. 285:985 (1992), and 
hyaluronan synthase. Suitable mannosyltransferases include a(l,2) mannosyltransferase, 
5 a(l,3) mannosyltransferase, P(l,4) mannosyltransferase, Dol-P-Man synthase, OChl, and 
Pmtl. 

[01481 For the above glycosyltransferase cycles, the concentrations or amounts of the 
various reactants used in the processes depend upon numerous factors including reaction 
conditions such as temperature and pH value, and the choice and amount of acceptor 

10 saccharides to be glycosylated. Because the glycosylation process permits regeneration of 
activating nucleotides, activated donor sugars and scavenging of produced PPi in the 
presence of catalytic amounts of the enzymes, the process is limited by the concentrations or 
amounts of the stoichiometric substrates discussed before. The upper limit for the 
concentrations of reactants that can be used in accordance with the method of the present 

1 5 invention is determined by the solubility of such reactants. 

[01491 Preferably, the concentrations of activating nucleotides, phosphate donor, the donor 
sugar and enzymes are selected such that glycosylation proceeds until the acceptor is 
consumed. 

[0150] Each of the enzymes is present in a catalytic amount. The catalytic amount of a 
20 particular enzyme varies according to the concentration of that enzyme's substrate as well as 
to reaction conditions such as temperature, time and pH value. Means for determining the 
catalytic amount for a given enzyme under preselected substrate concentrations and reaction 
conditions are well known to those of skill in the art. 

[01511 The fucosyltransferase reaction can be carried out using an oligosaccharide or 
25 polysaccharide as an acceptor molecule. Suitable acceptor substrates used by the H. pylori 
fucosyltransferase proteins and methods of the invention include, but are not limited to, 
polysaccharides, oligosaccharides, lipids, and glycolipids. For example, the oligosaccharide 
LNnT can be fucosylated to form LNFIU. The fiicosyltmaferases described herein can also 
be used in multienzyme systems to produce a desired product from a convenient starting 
30 material. For example, LNFin was prepared on a multigram scale from lactose using the H. 
pylori a-l,3/4-fucosyltranferases from strain 1 182 described herein, in combination with 
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Neisseria gonococcus /5-l,3N-acetylglucosaininyltransferase (IgtA) Neisseria 
gonococcus i8-l,4-galactosyItransferase (IgtB). 

[01521 The recombinant fiicosyltransferase fusion protein used in the methods of the 
invention is chosen based upon its ability to fucosylate the fiicosyltransferase acceptor 
substrates of interest. Preferably, the fiicosyltransferase is assayed for suitability using a 
fiicosyltransferase acceptor substrate that is attached to a soluble saccharide or 
oHgosaccharide. The use of a soluble saccharide or oligosaccharide acceptor substrate in the 
assay to detennine fiicosyltransferase activity allows one to select a fiicosyltransferase that 
produces the desired oligosaccharide product. 

[0153] The fiicosyltransferase reaction can be carried out using a lipid or glycolipid as an 
acceptor molecule. Many saccharides require the presence of particular fiicosylated 
stractures in order to exhibit biological activity. Intercellular recognition mechanisms often 
require a fiicosylated oligosaccharide. For example, a number of proteins that fimction as cell 
adhesion molecules, including P-selectin, E-selectin, bind specific cell surface fiicosylated 
carbohydrate stmctures, for example, the sialyl Lewis x and the sialyl Lewis a structures. In 
addition, the specific carbohydrate structures that form the ABO blood group system are 
fiicosylated. The carbohydrate structures in each of the three groups share a Fucal,2Gaipi- 
dissacharide unit. In blood group O structures, this disaccharide is the terminal structure. 
The group A structure is formed by an al,3 GalNAc transferase that adds a terminal GaLNAc 
residue to the dissacharide. The group B structure is formed by an al,3 galactosyltransferase 
that adds terminal galactose residue. The Lewis blood group structures are also fiicosylated. 
For example the Lewis x and Lewis a stmctures are Gaipi,4(Fucal,3)GlcNac and 
GaIpl,4(Fucal,4)GlcNac, respectively. Both these stractures can be fiirther sialylated 
(NeuAca2,3-) to form the coiresponding sialylated structures. Other Lewis blood group 
structures of interest are the Lewis y and b structures which are 
Fucal,2Gaipi,4(Fucal,3)GlcNAcp-OR and Fucal,2Galpl,3(Fucal,4)GlcNAc.OR, 
respectively. For a description of the stractures of the ABO and Lewis blood group stuctures 
and the enzymes involved in their synthesis see, Essentials ofGlycobiology, Varki et al. eds.. 
Chapter 16 (Cold Spring Harbor Press, Cold Spring Harbor, NY, 1999). 

[0154] The recombinant fiicosyltransferase fiision protein used in the methods of the 
invention is chosen based upon its abiHty to fiicosylate the fiicosyltransferase acceptor 
substrates of interest. Preferably, the fiicosyltransferase is assayed for suitabihty using a 
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fiicosyltransferase acceptor substrate that is attached to a lipid or glycolipid. The use of a 
glycolipid-linked acceptor substrate, rather than an acceptor substrate that is part of a soluble 
oligosaccharide, in the assay to determine fiicosyltransferase activity allows one to select a 
fiicosyltransferase that produces the selected fiicosylation pattern on the glycolipid. 

5 (01551 Fucosyltransferases have been used in synthetic pathways to transfer a fiicose unit 
firom guanosine-5'-diphosphofucose to a specific hydroxyl of a saccharide acceptor. For 
example, Ichikawa prepared sialyl Lewis-X by a method that involves the fiicosylation of 
sialylated lactosamine with a cloned fiicosyltransferase (Ichikawa et al.^ J. Am, Chem. Soc, 
114: 9283-9298 (1992)). Lowe has described a method for expressing non-native 
10 fiicosylation activity in cells, thereby producing fiicosylated glycoproteins, cell surfaces, etc, 
(U.S. Patent No. 5,955,347). 

[0156] In one embodiment, the methods of the invention are practiced by contacting a 
substrate, having an acceptor moiety for a fiicosyltransferase, with a reaction mixture that 
includes a fiicose donor moiety, a fiicosyltransferase, and other reagents required for 
15 fiicosyltransferase activity. The substrate is incubated in the reaction mixture for a sufficient 
time and imder appropriate conditions to transfer fiicose from the fiicose donor moiety to the 
fiicosyltransferase acceptor moiety. In preferred embodiments, the fiicosyltransferase 
catalyzes the fiicosylation of at least 60% of the fiicosyltransferase respective acceptor 
moieties in the composition. 

20 10157] Specificity for a selected substrate is only the first criterion a preferred 

fiicosyltransferase should satisfy. The fiicosyltransferase used in the method of the invention 
is preferably also able to efficiently fiicosylate a variety of substrates, and support scale-up of 
the reaction to allow the fiicosylation of at least about 500 mg of the substrate. More 
preferably, the fiicosyltransferase will support the scale of the fiicosylation reaction to allow 

25 the synthesis of at least about 1 kg, and more preferably, at least 10 kg of substrate with 
relatively low cost and infi^tructure requirements. 

[0158] Suitable acceptor moieties for fiicosyltransferase-catalyzed attachment of a fiicose 
residue include, but are not limited to, GlcNAc-OR, Gaipi,3GlcNAc-OR, 
NeuAca2,3Gaipi,3GlcNAc-OR, Galpl,4GlcNAc-OR and NeuAca2,3Galpl,4GlcNAc-OR, 
30 where R is an amino acid, a saccharide, an oligosaccharide or an aglycon group having at 
least one carbon atom. R is linked to or is part of a substrate. The appropriate 
fiicosyltransferase for a particular reaction is chosen based on the type of fiicose linkage that 
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is desired (e.g., o2, a3, or a4), the particular acceptor of interest, and the ability of the 
fucosyltransferase to achieve the desired high yield of fiicosylation. Suitable 
fiicosyltransferases and their properties are described above. 

[0159] If a sufficient proportion of the substrate-linked oligosaccharides in a composition 
does not include a fucosyltransferase acceptor moiety, one can synthesize a suitable acceptor. 
For example, one preferred method for synthesizing an acceptor for a fucosyltransferase 
involves use of a GlcNAc transferase to attach a GlcNAc residue to a GlcNAc transferase 
acceptor moiety, which is present on the substrate-linked oligosaccharides. Li preferred 
embodiments a transferase is chosen, having the abihty to glycosylate a large fraction of the 
potential acceptor moieties of interest. The resulting GlcNAcP-OR can then be used as an 
acceptor for a fiicosyltransferase. 

[0160] The resultmg GlcNAcp-OR moiety can be galactosylated prior to the 
fucosyltransferase reaction, yielding, for example, a Gaipi,3GlcNAc-OR or Gal 
pl,4GlcNAc-OR residue. In some embodiments, the galactylation and fiicosylation steps can 
be carried out simultaneously. By choosing a fucosyltransferase that requires the ' 
galactosylated acceptor, only the desired product is formed. Thus, this method involves: 

(a) galactosylating a compound of the formula GlcNAcp-OR with a 
galactosyltransferase in the presence of a UDP-galactose under conditions sufficient to fomi 
the compounds Gaipi,4GlcNAcp-OR or Gaipi,3GlcNAc-OR; and 

(b) fucosylating the compound fomied in (a) using a fucosyltransferase in 
the presraice of GDP-fiicose under conditions sufficient to form a compound selected from: 

Fuca 1 ,2Gaip 1 ,4GlcNAc 1 p-0 1 R; 

Fucal ,2Galp 1 ,3GlcNAc-OR; 

Fucal,2Gaipi,4GalNAclp-01R; 

Fuca 1 ,2Gaip 1 ,3GalNAc-OR; 

Galpl,4(Fucl,a3)GlcNAcp-OR; or 

Galpl,3(Fucal,4)GlcNAc-OR. 
[0161] One can add additional fucose residues to the above structures by including an 
additional fucosyltransferase, which has the desired activity. For example, tlie methods can 
form oUgosaccharide determinants such as Fucal,2Gaipi,4(Fucal,3)GlcNAcp-OR and 
Fucal ,2Galpl,3(Fucal,4)GlcNAc-OR. Thus, in another preferred embodiment, the method 
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includes the use of at least two fucosyltransferases. The multiple fiicosyltransferases are used 
either simultaneously or sequentially. When the fucosyltransferases are used sequentially, it 
is generally preferred that the glycoprotein is not purified between the multiple fiicosylation 
steps. When the multiple fucosyltransferases are used simultaneously, the enzymatic activity 
5 can be derived firom two sq>arate enzymes or, alternatively, Sx)m a single enzyme having 
more than one fiicosyltransferase activity. 

[0162] The fiicosyltransferase reaction can carried out by contacting recombinant 
fiicosyltransferase protein of the present invention with a mixture that includes, for example, 
multiple copies of a glycoprotein species, a majority of which preferably have one or more 
10 linked oUgosaccharide groups that include an acceptor substrate for a fiicosyltransferase; 
fucose donor substrate; and other reagents required for fiicosyltransferase activity. The 
glycoprotein is incubated in the reaction mixture for a sufficient time and under appropriate 
conditions to transfer fiicose firom a donor substrate to a fiicosyltransferase acceptor substrate. 

[01 63 J The recombinant fiicosyltransferase fusion protein used in the methods of the 
15 invention is chosen based upon its ability to fucosylate the fiicosyltransferase acceptor 
substrates of interest Preferably, the fiicosyltransferase is assayed for suitability using a 

fiicosyltransferase acceptor substrate that is attached to a glycoprotein. The use of a 
glycoprotein-linked acceptor substrate, rather than an acceptor substrate that is part of a 
soluble oligosaccharide, in the assay to determine fiicosyltransferase activity allows one to 
20 select a fiicosyltransferase that produces the selected fucosylation pattern on the glycoprotein. 

[0164] In a preferred embodiment, the recombinant fiicosyltransferase fiision protein of the 
present invention has a high level of expression in cells and/or high enzymatic activity ( e^., 
high specificity for a selected substrate and/or high catalytic activity). In another preferred 
embodiment, the fiicosyltransferase is useful in a method for fiicosylating a commercially 

25 important recombinant or transgenic glycoprotein. The fiicosyltransferase used in the method 
of the invention is preferably also able to efficiently fiicosylate a variety of glycoproteins, and 
support scale-up of the reaction to allow the fucosylation of at least about 500 mg of the 
glycoprotein. More preferably, the fiicosyltransferase will support the scale of the 
fiicosylation reaction to allow the synthesis of at least about 1 kg, and more preferably, at 

30 least 10 kg of recombinant glycoprotein with relatively low cost and infrastructure 
requirements. 
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[0165] In an exemplary embodiment, the method of the invention results in the formation 
on a glycoprotein of at least one ligand for a selectin. Confirmation of the formation of the 
ligand is assayed in an operational manner by probing the ability of the glycoprotein to 
interact with a selectin. The interaction between a glycoprotein and a specific selectin is 
5 measurable by methods familiar to those in the art ( see^ for example, Jutila et ah, J. 

ImmunoL 153: 3917-28 (1994); Edwards et al. Cytometry 43(3): 211-6 (2001); Stahn et al, 
Glycobiology 8: 311-3 19 (1998); Luo et ai, J. Cell Biochem, 80(4):522-31 (2001); Dong et 
al, J. Biomech, 33(1): 35-43 (2000); Jung et al, J. Immunol . 162(11): 6755-62 (1999); 
Keramidaris et al, J. Allergy Clin. ImmunoL 107(4): 734-8 (2001); Fieger et al., Biochim. 
10 Biophys, A eta 1524(1): 75-85 (2001); Bruehl et al, J. Biol Chem. 275(42): 32642-8 (2000); 
Tangemann et al, J, Exp. Med. 190(7): 935-42 (1999); Scalia et al, Circ. Res. 84(1): 93-102 
(1999); Alon et al, J. Cell Biol 138(5): 1169-80 (1997); Steegmaier et al, Eur. J. Immunol 
27(6): 1339-45 (1997); Stewart et al, J. Med. Chem. 44(6): 988-1002 (2001); Schurmann et 
al. Gut 36(3): 41 1-8 (1995); Burrows et al, J. Clin. Pathol. 47(10): 939-44 (1994)). 

1 5 [01 66] Suitable acceptor substrates for fucosyltransferase-catalyzed attachment of a fiicose 
residue include, but are not limited to, GlcNAc-OR, Galpl,3GlcNAc-OR, 
NeuAca2,3Galpl,3GlcNAc-OR, GaIpl,4GlcNAc-0R and NeuAca2,3Galpl,4GIcNAc-OR, 
where R is an amino acid, a saccharide, an oligosaccharide or an aglycon group having at 
least one carbon atom. R is linked to or is part of a glycoprotein. The appropriate 

20 fucosyltransferase for a particular reaction is chosen based on the type of fiicose linkage that 
is desired (e.g., a2, a3, or a4), the particular acceptor of interest, and the ability of the 
fiicosyltransferase to achieve the desired high yield of fiicosylation. Suitable 
fiicosyltransferases and their properties are described above. 

[01 67] If a sufficient proportion of the glycoprotein-linked oligosaccharides in a 
25 composition does not include a fiicosyltransferase acceptor substrate, one can synthesize a 
suitable acceptor. For example, one preferred method for synthesizing an acceptor for a 
fiicosyltransferase involves use of a GlcNAc transferase to attach a GlcNAc residue to a 
GlcNAc transferase acceptor substrate, which is present on the glycoprotein-linked 
oligosaccharides. In preferred embodiments a transferase is chosen, having the ability to 
30 glycosylate a large fi^action of the potential acceptor substrates of interest. The resulting 
GlcNAcp-OR can then be used as an acceptor for a fiicosyltransferase. 
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[0168] The resulting GlcNAcp-OR moiety can be galactosylated prior to the 
fucosyltransferase reaction, yielding, for example, a Gaipi,3GlcNAc-OR or Gal 
pl,4GlcNAc-OR residue. In some embodiments, the galactosylation and fiicosylation stqps 
are carried out simultaneously. Thus, this method involves: 

(a) galactosylating a compound of the formula GlcNAcp~OR with a 
galactosyltransferase in the presence of a UDP-galactose under conditions sufficient to form 
the compounds Gaipi,4GlcNAcP-OR or Galpl,3GlcNAc-OR; and 

(b) fucosylating the compound formed in (a) using a fucosyltransferase in 
the presence of GDP-fucose under conditions sufficient to form a compound selected from: 

Fucal ,2Galp 1 ,4GlcNAcl p-OlR; 

Fucal,2Gaipi,3GlcNAc-OR; 

Fucal ,2Gaip 1 ,4GaINAcl P-OlR; 

Fucal,2Galpl,3GalNAc-OR; 

Galpl,4(Fucl,a3)GlcNAcp-OR; or 

Galp 1 ,3(Fuca 1 ,4)GlcNAc-OR. 
[0169] One can add additional fucose residues to a fucosylated glycoprotein treating the 
fiicosylated peptide with a fucosyltransferase, which has the desired activity. For example, 
the methods can form oligosaccharide determinants such as 

Fucal,2Gaipi,4(Fucal,3)GlcNAcp-OR and Fucal,2Gaipi,3(Fucal,4)GlcNAc.OR. Thus, 
in another preferred embodiment, the method includes the use of at least two 
fiicosyltransferases. The multiple fucosyltransferases are used either simultaneously or 
sequentially. When the fucosyltransferases are used sequentially, it is generally preferred that 
the glycoprotein is not purified between the multiple fiicosylation steps. When the multiple 
fucosyltransferases are used simultaneously, the enzymatic activity can be derived from two 
separate enzymes or, altematively, from a single enzyme having more than one 
fucosyltransferase activity. 

H. Multiple-enzyme oligosaccharide synthesis 
[0170] As discussed above, in some embodiments, two or more enzymes may be used to 
form a desired oligosaccharide or oligosaccharide determinant on a glycoprotein or 
glycolipid. For example, a particular oligosaccharide determinant might require addition of a 
galactose, a sialic acid, and a fucose in order to exhibit a desired activity. Accordingly, the 
invention provides methods in which two or more enzymes, glycosyltransferases, trans- 
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sialidases, or sulfotransferases, are used to obtain high-yield synthesis of a desired 
oligosaccharide determinant. 

[0171] In one preferred embodiment, LNFIII was prepared from lactose using the H. pylori 
or-l,3/4-iucosyltranferases from strain 1 1 82 described herein, in combination with Neisseria 
5 gonococcus ,3N-acetylglucosaminyltransferase (IgtA) and Neisseria gonococcus ,4- 
galactosyltransferase (IgtB). Those of skill will recognize that other i8-l,3N- 
acetylglucosaminyltransferase and /3-1,4-galactosyltransferase enzymes can be used in this 
embodiment of the invention. 

[0172J In some cases, a glycoprotein- or glycolipid Imked oligosaccharide will include an 
1 0 acceptor substrate for the particular glycosyltransferase of interest upon in vivo biosynthesis 
of the glycoprotein or glycolipid. Such glycoproteins or glycolipids can be glycosylated 
using the K pylori fiicosyltransferase proteins and methods of the invention without prior 
modification of the glycosylation pattern of the glycoprotein or glycolipid, respectively. In 
other cases, however, a glycoprotein or glycolipid of interest will lack a suitable acceptor 
1 5 substrate. In such cases, the methods of the invention can be used to alter the glycosylation 
pattem of the glycoprotein or glycolipid so that the glycoprotein-or glycolipid-linked 
oligosaccharides then include an acceptor substrate for the glycosyltransferase-catalyzed 
attachment of a preselected saccharide unit of interest to fomi a desired oligosaccharide 
moiety. 

20 [0173] Glycoprotein- or glycoUpid linked oligosaccharides optionally can be first 
"trimmed," either in whole or in part, to expose either an acceptor substrate for the 
glycosyltransferase or a moiety to which one or more appropriate residues can be added to 
obtain a suitable acceptor substrate. Enzymes such as glycosyltransferases and 
endoglycosidases are usefiil for the attaching and trimming reactions. For example, a 

25 glycoprotein that displays "high maimose"-type oligosaccharides can be subjected to 

trimming by a mannosidase to obtain an acceptor substrate that, upon attachment of one or 
more preselected saccharide units, forms the desired oligosaccharide determinant. 

[0174] The methods are also useful for synthesizing a desired oligosaccharide moiety on a 
protein or lipid that is unglycosylated in its native form. A suitable acceptor substrate for the 
30 corresponding glycosyltransferase can be attached to such proteins or lipids prior to 

glycosylation using the methods of the present invention. See, e.g., US Patent No. 5,272,066 
for methods of obtaining polypeptides having suitable accq>tors for glycosylation. 
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[01751 Thus, in some embodiments, the invention provides methods for in vitro siaiylation 
of saccharide groups present on a glycoconjugate that first involves modifying the 
glycoconjugate to create a suitable acceptor. Examples of preferred methods of multi- 
enzyme synthesis of desired oligosaccharide moieties are as follows. 

5 Fucosylated and sialylated oligosaccharide moieties 

[0176] Oligosaccharide determinants that confer a desired biological activity upon a 
glycoprotein often are sialylated in addition to being fucosylated. Accordingly, the invention 
provides methods in which a glycoprotein-linked oligosaccharide is sialylated and 
fucosylated in high yields. 

10 [0177] The siaiylation can be accomplished using either a trans-sialidase or a 

sialyltransferase, except where a particular moiety requires an a2,6-linked sialic acid, in 
which a sialyltransferase is used. Suitable examples of each type of enzyme are described 
above. These methods involve sialylating an acceptor for a sialyltransferase or a trans- 
sialidase by contacting the acceptor with the appropriate enzyme in the presence of an 

15 appropriate donor substrate. For sialyltransferases, CMP-sialic acid is a preferred donor 

substrate. Trans-sialidases, however, preferably use a donor substrate that includes a leaving 
group to which the trans-sialidase cannot add sialic acid. 

[01781 Acceptor substrates of interest include, for example, Gaip-OR. In some 
embodiments, the acceptor substrates are contacted with a sialyltransferase in the presence of 

20 CMP-sialic acid under conditions in which sialic acid is transferred to ttie non-reducing end 
of the acceptor substrate to form the compound NeuAca2,3Gaip-OR or NeuAca2,6Gaip-OR. 
In this formula, R is an amino acid, a saccharide, an oligosaccharide or an aglycon group 
having at least one carbon atom. R is linked to or is part of a glycoprotein. An ot2,8- 
sialyltransferase can also be used to attach a second or multiple sialic acid residues to the 

25 above structures. 

[0179} To obtain an oligosaccharide moiety that is both sialylated and fiicosylated, the 
sialylated acceptor is contacted with a fucosyltransferase as discussed above. The 
sialyltransferase and fiicosyltransferase reactions are generally conducted sequentially, since 
most sialyltransferases are not active on a fucosylated acceptor. FT Vn, however, acts only 
30 on a sialylated acceptor substrate. Therefore, FTVII can be used in a simultaneous reaction 
with a sialyltransferase. 
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[0180] If the trans-sialidase is used to accomplish the sialylation, the fucosyiatioa and 
sialylation reactions can be conducted either simultaneously or sequentially, in either order. 
The protein to be modijSed is incubated with a reaction mixture that contains a suitable 
amount of a trans-sialidase, a suitable sialic acid donor substrate, a fucosyltransferase 
(capable of making an a 1,3 or al,4 linkage), and a suitable fucosyl donor substrate (e.g^., 
GDP-fiicose). 

Galactosylated, fucosylated and sialylated oligosaccharide determinants 
[0181] The invention also provides methods for enzymatically synthesizing oligosaccharide 
moieties that are galactosylated, fucosylated, and sialylated. Either a sialyltransferase or a 
trans-siahdase (for a2,3-Hnked sialic acid only) can be used in these methods. 

[0182] The trans-sialidase reaction involves incubating the protein to be modified with a 
reaction mixture that contains a suitable amount of a galactosyltransferase (gaipi,3 or 
gaipi,4), a suitable galactosyl donor {e.g., UDP-galactose), a trans-sialidase, a suitable sialic 
acid donor substrate, a fucosyltransferase (capable of making an al,3 or al,4 linkage), a 
suitable fucosyl donor substrate (e.g., GDP-fucose), and a divalent metal ion. These reactions 
can be carried out either sequentially or simultaneously. 

[0183] If a sialyltransferase is used, the method involves incubating the protein to be 
modified with a reaction mixture that contains a suitable amoimt of a galactosyltransferase 
(gaipi,3 or galpl,4), a suitable galactosyl donor {e.g.^ UDP-galactose), a sialyltransferase 
(a2,3 or a2,6) and a suitable sialic acid donor substrate (e.g., CMP sialic acid). The reaction 
is allowed to proceed substantially to completion, and then a fucosyltransferase (capable of 
making an al,3 or al,4 linkage) and a suitable fucosyl donor substrate (e.g., GDP-fucose). If 
a fucosyltransferase is used that requires a sialylated substrate {e.g., FT VII), the reactions 
can be conducted simultaneously. 

Sialyltransferase reactions 
[01 84] As discussed above, in some embodiments, the present invention provides a H. 
pylori fucosyltransferase proteins and methods for fiicosylating a glycoprotein following the 
sialylation of the glycoprotein. In a preferred embodiment, the fusion proteins and methods 
of the invention synthesize glycoproteins having a substantially uniform sialylation pattem. 
The sialylated glycoprotein is then fucosylated, thereby producing a population of 
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fucosylated glycoproteins in which the members have a substantially xmiform fucosylation 
pattern. 

[0185] The glycoprotein can be contacted with a sialjdtransferase and a sialic acid donor 
substrate for a sufficient time and under appropriate reaction conditions to transfer sialic acid 
5 from the sialic acid donor substrate to the saccharide groups. Sialyltransferases comprise a 
family of glycosyltransferases that transfer sialic acid from the donor substrate CMP-sialic 
acid to accq>tor oligosaccharide substrates. In preferred embodiments, the sialyltransferases 
are recombinant sialyltransferase fusion proteins. Suitable sialyltransferase reactions are 
described in US Provisional Application No, 60/035,710, filed January 16, 1997 and US 
10 nonprovisional Application No. 09/007,741, filed January 15, 1998. 

[0186] In some embodiments, the saccharide moieties on a glycoprotein having sialylation 
patterns altered by the H. pylori fucosyltransferase proteins of the present invention have a 
greater percentage of terminal galactose residues sialylated than the unaltered glycoprotein. 
Preferably, greater than about 80% of terminal galactose residues present on the 

1 5 glycoprotein-linked oligosaccharides will be sialylated following use of the methods. More 
preferably, use of the H. pylori fucosyltransferase proteins and methods of the invention will 
result in greater than about 90% sialylation, and even more preferably greater than about 95% 
sialylation of terminal galactose residues. Most preferably, essentially 100% of the terminal 
galactose residues present on the glycoproteins in the composition are sialylated following 

20 modification using the methods of the present invention. The fiision proteins and methods of 
the inventions are typically capable of achieving the desired level of sialylation in about 48 
hours or less, and more preferably in about 24 hours or less. 

(01871 At least 15 different mammalian sialyltransferases have been documented, and the 
cDNAs of thirteen of these have been cloned to date (for the systematic nomenclature that is 
25 used herein, see, Tsuji et al (1996) Glycobiology 6: v-xiv). These cDNAs can be used for 
making the recombinant sialyltransferase fusion proteins of the invention. 

[0188] Preferably, for glycosylation of N-linked and/or O-linked carbohydrates of 
glycoproteins, the sialyltransferase transfer sialic acid to the terminal sequence Galpl,4-OR 
or GalNAc-OR, where R is an amino acid, a saccharide, an oligosaccharide or an aglycon 
30 group having at least one carbon atom and is linked to or is part of a glycoprotein. Galp 1 ,4- 
GlcNAc is the most common penultimate sequence underlying the terminal sialic acid on 
fully sialylated carbohydrate stroctures. At least three of the cloned mammalian 
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sialyltransferases meet this acceptor specificity requirement, and each of these have been 
demonstrated to transfer sialic acid to N-linked and O-linked carbohydrate groups of 
glycoproteins. 

[0189] In some embodiments, the invention sialylation methods that have increased 
commercial practicality through the use of bacterial sialyltransferases, eiflier recombinantly 
produced or produced in the native bacterial cells. Two bacterial sialyltransferases have been 
recently reported; an ST6Gal n from Photobacteriiim damsela (Yamamoto et al. (1996) J. 
Biochem. 120: 104-1 10) and an ST3Gal V from Neisseria meningitidis (Gilbert et al. (1996) 
J. Biol. Chem. 271: 28271-28276). The two recently described bacterial enzymes transfer 
siahc acid to the Gaipi,4GlcNAc sequence on oligosaccharide substrates. 

(01901 A recently rqjorted viral a2,3-sialyltransferase is also suitable for testing and 
possible use in the sialylation methods of the invention (Sujino etal. (2000) Gfycobiology 
BIO: 313-320). This enzyme, v-ST3Gal I, was obtained from Myxoma virus-infected cells 
and is apparently related to the mammalian ST3Gal IV as indicated by comparison of the 
respective amino acid sequences. v-ST3Gal I catalyzes the sialylation of Type I (Galpl,3- 
GlcNAcpl-R), Type H (Gaipi,4GlcNAc-pl-R) and HI (Gal pi,3GalNAcpi-R) acceptors. 
The enzyme can also transfer sialic acid to fucosylated acceptor substrates {e.g., Lewis-x and 
Lewis-a). 

[0191] An example of a sialyltrausferase that is useful in the claimed methods is ST3Gal 
m, which is also referred to as a(2,3)sialyltransferase (EC 2.4.99.6). This enzyme catalyzes 
the transfer of sialic acid to the Gal of a Gaipi,3GlcNAc, Gaipi,3GalNAc or 
Galpl,4GlcNAc glycoside {see, e.g.. Wen et al. (1992) J. Biol. Chem. 267: 21011; Van den 
Eijnden et al (1991) J. Biol Chem. 256: 3159). The sialic acid is linked to a Gal with the 
formation of an a-linkage between the two saccharides. Bonding (linkage) between the 
saccharides is between the 2-position of NeuAc and the 3-positton of Gal. This particular 
enzyme can be isolated from rat liver (Weinstein et al. (1982) J. Biol. Chem. 257: 13845); the 
human cDNA (Sasaki et al. (1993) J. Biol. Chem. 268: TXJZl-in^l; Kitagawa & Paulson 
(1994) J. Biol Chem. 269: 1394-1401) and genomic (JCitagawa et al. (1996) J. Biol. Chem. 
271: 931-938) DNA sequences are known, facilitating production of this enzyme by 
recombinant expression. In a preferred embodiment, the claimed sialylation methods use a 
ratST3Galin. 
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[0192] Other sialyltransferases, including those Usted above, are also useful in an economic 
and efiBcient large scale process for sialylation of commercially important glycoproteins. As 
described above, a simple test to find out the utility of these other enzymes, is to react various 
amounts of each enzyme (1-100 mU/mg protein) with a readily available glycoprotein protein 
5 such as asialo-ai-AGP (at 1-10 mg/ml) to compare the ability of the sialyltransferase of 
interest to sialylate glycoproteins. The results can be compared to, for example, either or 
both of an ST6Gal I or an ST3Gal HI (e.g., a bovine or human enzyme), depending upon the 
particular sialic acid linkage that is desired. Alternatively, other glycoproteins or 
glycoproteins, or N- or O-linked oligosaccharides enzymatically released from the peptide 

10 backbone can be used in place of asialo-ai AGP for this evaluation, or one can use 

saccharides that are produced by other methods or purified from natural products such as 
milk. Preferably, however, the sialyltransferases are assayed using an oligosaccharide that is 
linked to a glycoprotein. Sialyltransferases showing an ability to, for example, sialylate N- 
linked or O-linked oligosaccharides of glycoproteins more efficiently than ST6Gal I are 

15 useful in a practical large scale process for glycoprotein sialylation. 

[01931 The invention also provides methods of altering the sialylation pattem of a 
glycoprotein prior to fticosylation by adding sialic acid in an a2,6Gal linkage as well as the 
a2,3Gal linkage, both of which are found on N-linked oligosaccharides of human plasma 
glycoproteins. In this embodiment, ST3Gal UI and ST6Gal I sialyltransferases are both 
20 present in the reaction and provide proteins having a reproducible ratio of the two linkages 
formed in the resialylation reaction. Thus, a mixture of the two enzymes may be of value if 
both linkages are desired in the final product. 

[0194] An acceptor substrate for the sialyltransferase is present on the glycoprotein to be 
modified by the sialylation methods described herein. Suitable acceptors include, for 

25 example, galactosylated acceptors such as Galpl,4GlcNAc, Galpl,4GaINAc, 

Galpl,3GalNAc, Gaipi,3GlcNAc, Gaipi,3Ara, Galpl,6GlcNAc, Galpl,4Glc (lactose), 
GalNAc-O-Ser, GalNAc-O-Thr, and other acceptors known to those of skill in the art (see, 
e.g., Paulson et al (1978) BioL Chenu 253: 5617-5624). Typically, the acceptors are 
included in oligosaccharide chains that are attached to asparagine, serine, or threonine 

30 residues present in a protein. 
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EXAMPLES 

[0195] The following examples are offered to illustrate, but not to limit the claimed 
invention. 

Example 1 : Cloning of Helicobacter pylori fucosvltransferases. 
5 [0196] Putative ifucosyltransferase genes from the following strains of Helicobacter pylori 
were PGR amplified, cloned into vectors for expression in E. coli: strain 915 FutA, strain 
1 1 1 1 FutA, strain 19C2 FutB, strain 1 1 82 FutB, strain 19C2 FutA, strain 26695 FutA, and 
strain 1218 FutB. Nucleic acid and amino acid sequences are provided in Figures 1-7. An 
amino acid sequence alignment is provided in Figure 12; a nucleic acid sequence alignment is 
1 0 provided in Figure 1 3 . 

[0197] The putative fucosyltransferase proteins were screened for otl ,3/4-fucosyltransferase 
activity using LNnT and GDP-fiicose substrates. The oligostructures of UsfnT and one 
product, LNFPin are shown in Figure 14. 

[0198] One hundred milliliter cultures of E. coli transformed with H. pylori 
15 fucosyltransferase were grown to OD600 of 0.8 and induced with IPTG, and harvested. Cell 
lysates were made using a french press. The fucosyltransferase enzymes were tested for 
enzymatic activity and acceptor specificity using the substrate LNnT. The reactions 
contained 3mM GDP-fucose, 3mM LNnT, 50mM Tris pH 7.5, 20mM MnCla, and 15% 
bacterial lysate. Reactions were incubated at 37°C for twenty-foxu* hours. 

20 [0199] Reaction products were separated using the following TLC- buffer system: 7 IPA:2 
H20:l Acetic acid. The samples were methylated, hydrolyzed, reduced wifli sodium 
borodeuteride, acetylated and analyzed by GC/MS along with samples of LNnT and LNF3. 
Results are shown in Figure 15. A Glc vs. GIc-NAc value close to 1 favors fiicosylation of 
Glc-NAc. A Glc vs. Glc-NAc value close to 0 favors fiicosylation of Glc. 

25 Fucosyltransferases from the following H, pylori strains transferred flicose to GIc-NAc: 

strain 915 FutA, strain 1111 FutA, strain 19C2 FutB, and strain 1 182 FutB. The FutA gene 
product from H. pylori strain 19C2A transferred fucose to the reducing glucose of the LNnT 
acceptor, as did the FutB gene product from H pylon strain 1218 FutB. A novel FutA gene 
product from H. pylori strain 26695 also catalyzed the transfer of fiicose to glucose. 

30 Example 2: Production of oligosaccharides using Helicobacter pylori fucosvltransferases. 
. [0200] One liter cultures of E. coli expressing K pylori fucosyltransferases were grown, 
induce and harvested. The lysates were used to synthesize LNFm from LNnT. Two 
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different ion exchange resins were tested for purification of LNFHI. Reaction mixtures were 
centrifuged at 5,000 RPM for thirty minutes. Samples were then processed by ultrafiltration 
using hollow fiber ultrafiltration membranes with a molecular weight cut off of 10 kD. Ion 
exchange chromatography was done using either MRS NH4HCO3 coliunn 1ml resin per Iml 
5 synthesis (70%) or Dowexl/DowexSO column 2ml resin per 1ml synthesis (82%). Samples 
were then run on a P2 Size Exclusion column and then lyophilized. Results are shown in 
Figure 16. Yields using the Dowex resin approached 50%, while yields fi^om the MRS 
NEL1HCO3 column approached 70%. 

[0201] LNFin was prepared firom lactose using lysates fi*om E, coli cells expressing H. 
10 pylori a-U3/4"fiicosyltranferases fi*om strain 1 182 described herein, in combination with 
Neisseria gonococcus j8-l,3N-acetylglucosaminyltransferase (IgtA) and Neissena 
gonococcus jS-l ,4-galactosyItransferase (IgtB) on a multigram scale. Those of skill will 
recognize that other /3-l,3N-acetylglucosaminyI transferase and j8-l,4-galactosyltransferase 
enzymes can be used in this embodiment of the invention. 

15 Example 3: Production of glycoproteins using Helicobacter pylori fucosvltransferases. 
[0202] The ability of fiicosyltransferase from H. pylori strain 1 182B to add fucose to 
acceptor molecules on glycoprotein was tested using asialyltransferrin as a substrate. The 
1 1 82B fiicosyltransferase was produced in E, coli cells as described above. The reactions 
were carried out in a buffer containing 50 mM Tris pH. 7.5, 20 mM MnCb, 200 /ig 

20 asialyltransferrin, and 5mM GDP-fucose. Reactions were started by adding 15% v/v of the 
bacterial lysate. The reaction was incubated overnight at 37**C. The samples were analyzed 
using GC/MS. Results are shown in Figure 1 7. 

[0203] It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
25 suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference in their entirety for all 
purposes 
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WHAT IS CLAIMED IS: 

1 1 . A method for producing a fucosylated glycoprotein, the method 

2 comprising: 

3 contacting a recombinant fucosyltransferase protein with a mixture comprising 

4 a donor substrate comprising a fucose residue, and an acceptor substrate on a glycoprotein, 

5 under conditions where the fucosyltransferase catalyzes the transfer of the fucose residue 

6 from a donor substrate to the acceptor substrate on the glycoprotein, thereby producing a 

7 fucosylated glycoprotein, 

8 wherein the recombinant fucosyltransferase protein comprises a polypeptide 

9 having greater than 90% identity to an amino acid sequence selected from the group 
1 0 consisting of SEQ ID NO:2, 4, 6, and 8. 

1 2. The method of claim 1, wherein the polypeptide comprises an amino 

2 acid sequence selected from the group consisting of SEQ ID NO: 2, 4, 6, and 8. 

1 3. The method of claim 1 , wherein the polypeptide comprises SEQ ID 

2 NO: 2. 

1 4, The method of claim 1 , wherein the polypeptide further comprises an 

2 amino acid tag. 

1 5. The method of claim 1, wherein the method further comprises a step of 

2 purifying the fucosylated glycoprotein. 

1 6. The method of claim 1 , wherein the acceptor substrate is a glucose 

2 residue, and wherein the recombinant fucosyltransferase protein comprises a polypeptide 

3 having greater than 90% identity to SEQ ID NO:6. 

1 7. The method of claim 1 , wherein the acceptor substrate is an N- 

2 acetylglucosamine residue, and wherein the recombinant fucosyltransferase protein 

3 comprises a polypeptide having greater than 90% identity to an amino acid sequence selected 

4 from the group consisting of SEQ ID NO:2, 4, and 8. 

1 . 8. The method of claim 1 , wherein an acceptor substrate on the 

2 glycoprotein comprises Gaipi-OR, Gaip,3/4GlcNAc-OR, NeuAca2,3Gaipi,3/4GlcNAc-Or, 
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wherein R is an amino acid, a saccharide, an oUgosaccharide, or an aglycon group having at 
least one carbon atom. 
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FIGURE 1 

Fucosvltransferase nucleotide sequence from strain 1 182 FutB (SEP CD NO:!) 

atgttccaacccctattagacgcttatatagaaagcgcttccattgaaaaaattacctctaaatctcccccccccctaaaaatcgctg 

tggcgaattggtggggagatgaagaggttgaagaatttaaaaagaacattctttattttattctcagtcagcattacacaatcacTO^ 

ccaccaaaaccccaacgaaccctccgatctcgtctttggcagtcctattggatcagccagaaaaatcttatc^ 

aaagagtgttttacaccggtgaaaacgaatcgcctaatttcaacctctttgattacgccataggctttgatgaattggattttagagat 

cgttatttaagaatgcctttatattatgatagactacaccataaagccgagagcgtgaatgacaccacttcgccttacaaactcaaac 

ctgacagcctttatgctttaaaaaaaccctcccatcattttaaagaaaaccaccccaatttalgcgcagtagtgaacaatga 

atcctttgaaaagagggtttgcgagttttgtagcgagcaaccctaacgctcctaaaaggaatgctttctatgacgttttaaattctato 

gagccagttattgggggagggagcgtgaaaaacactttaggctataacattaaaaacaagagcgagtttttaagccaatacaaat 

tcaatctgtgttttgaaaactcacaaggctatggctatgtaactgaaaaaatcattgacgcttactttagccataccatt^ 

ggggagtcctagcgtggcacaagattttaaccctaagagttttgtgaatgtttgtgattttaaagattttgat^ 

gcgatacttgcacacgcacccaaacgcttatttagacatgctttatgaaaaccctttaaacacccttgatgggaaagcttacttttac 

caaaatttgagttttaaaaaaatcctagatttttttaaaacgattttagaaaacgacacgatttatcacgataacccttttattttttatcgt 

gatttgaatgagccgttaatatctattgatgatgatttgagggttaattatgatgatttgagggttaattatgatgatttgagggtt 

tgatgatttgagggttaattatgatgatttgagggttaattatgatgatttgagggttaattatgatgatttgagggttaattatga 

gagggttaattatgatgatttgagggttaattatgatgatttgagggttaattatgatgatttgagggttaattatgagcggctcttaca 

aaacgcctcgcctttattagaactctctcaaaacaccacttttaaaatctatcgcaaagcttatcaaaaatccttacctttgttgcgtg 

ggcgagaaagttgattaaaaaattgggtttgtaa 

Protein sequence from strain 1 182 FutB (SEP ID NO:21 

nifqplldayiesasiekitskspppMavanwwgdeeveefldaulyfilsqhyti 

qnakrvfytgenespnfiilfdyaigfdeldfrdrylrmplyydrlhhkaesvndtt^^ 

cavvmesdplkrgfasfvasnpnapkrnafydvliisiepvigggsvkntlgynil^ 

kiidayfshtipiywgspsvaqdfepksfmvcdfkdfdeaidhvrylhthpnayldmlyen^ 

Idffktilendtiyhdnpfifydlneplisidddlrvnyddlrvnyddlivnydd^ 

Irvnyddlrvnyddlrvnyddlrvnyerliqnaspllelsqnttfldyrkayqksl^ 



Besf Available Copy 
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FIGURE2 
Fucosyltransferase from strain 1111 FutA 
Nucleotide coding sequence (SEP ID NQ:3^ 

atgttccaacccctattagatgcctttatagaaagcgctccattgaaaaaatggcctctaaatctccccccccta 

cgaattggtggggagatgaagaaattaaaaaatttaaaaagagcgttctttattttatcctaagccagcattaca 

ccgaaaccctgataaacctgcggacatcgtctttggtaacccccttggatcagccagaaaaatcttatcctatcaaaacgcaaaaa 

gggtgttttacaccggtgaaaatgaagtccctaacttcaacctcmgattacgccataggcmgatgaattggac 

tamgagaatgccmgtattatgcctatttgcattataaagccgagcttgttaatgacaccacttcgccttataaactccaacct^^ 

gcctttatgctttaaaaaaaccctcccatcattttaaagaaaaccaccccaatttgtgcgcagtagtgaat^ 

aaaagagggtttgcgagcmgtcgcaagcaaccctaacgctcctagaaggaacgctttttatgaggctttaaacgctattgagcc 

agttgctgggggagggagcgtgaaaaacactttaggctataatgtcaaaaacaagagcgagtttttaagccaatacaaattcaat 

ctgtgtmgaaaacactcaaggctatggctatgtaactgaaaagatcaLttgacgcttatttcagccatacc^^ 

agtcccagcgtggcgaaagattttaaccctaagagttttgtgaatgtccatgatttcaacaacWgatgM^ 

gatacttgcacacgcacccaaacgcttatttagacatgcactatgaaaaccctttaaacactattgatgggaaagcttactttt 

aaatttgagttttaaaaaaatcctagatttttttaaaacgattttagaaaacgacacgatctatca^^ 

amgaatgagccttcagtatctattgatggmgagggttaattatgatgamgagggttaattatgatgaW^ 

gatKgagggttaattatgagcgccttttacaaaacgcctcgcctttattagaactctctcaaaacaccacttt^^ 

gcttatcaaaaatccttgcctttgttgcgtgccataaggagatgggttaaaaagtaa 



Protein sequence (SEP ID NO:4) 

mfqpUdafiesaplkkwpMpplkiavanwwgdeeik^^ 

qnakrvfytgenevpnfelfdyaigfdeldfrdiylrmplyyaylhykaelv^^ 

Icavvnnesdpllagfasfvasnpnapimafyealnaiepvagggsvkntlgynvkn^ 

ekiidayfshtipiywgspsvakdfiq)ksfvnvhdfiinfdeaidyirylhthpna 

kildffktilendtiyhdnpfifyrdlnepsvsidglrv^yddlivnyddlnTiy 

yqkslpllrairrwvkk* 
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FIGURES 



Strain 1218 Futfi nucleotide sequence fSEO ID NO:5) 



atgttccaacccctattagacgcttatatagaaagcgcttccattgaaaaaattacctctaaatctcccccccccctaaa^ 

tggcgaattggtggggagatgaagaggttgaagaatttaaaaagaacattctttattttattctcagtcagcattacacaatcaccct 

ccaccaaaaccccaacgaaccctccgatctcgtctttggcagtcctattggatcagccagaaaaatcttatcctatcaaaacgcaa 

aaagagtgttttacaccggtgaaaacgaatcgcctaatttcaacctctttgattacgccataggctttgatgaattggattttagagat 

cgttatttaagaatgcctttatattatgatagactacaccataaagccgagagcgtgaatgacaccacttcgccttacaaactcaaac 

ctgacagcctttatgctttaaaaaaaccctcccatcattttaaagaaaaccaccccaatttatgcgcagtagtgaacaatgagagcg 

atcctttgaaaagagggtttgcgagttttgtagcgagcaaccctaacgctcctaaaaggaatgctttctatgacgctttaaattctata 

gagccagttattgggggagggagcgtgaaaaacactttaggctataacattaaaaacaagagcgagtttttaagccaatacaaat 

tcaatctgtgttttgaaaactcacaaggctatggctatgtaactgaaaaaatcattgacgcttactttagccataccattcctatttattg 

ggggagtcctagcgtggcacaagattttaacxxtaagagttttgtgaatgtttgtgattttaaagattttgatgaa^ 

gcgatacttgcacacgcacccaaacgcttatttagacatgctttatgaaaaccctttaaacacccttgatgggaaagcttacttttac 

caaaatttgagttttaaaaaaatcctagatttttttaaaacgatcttagaaaacgacacgatttatcacgataacccttttattttttatcgt 

gatttgaatgagccgttaatatctattgatgatttgagggttaattatgatgatttgagggttaattatgatgatttgagggttaato^ 

tgamgagggttaattatgatgatttgagggttaattatgatgatttgagggttaattatgatgatttgagggttaattatg 

gggttaattatgatgatttgagggttaattgtgatgatttgagggttaattatgatgatttgagggttaattatgagcggctcttacaaa 

acgcctcgcctttattagaactctctcaaaacaccacttttaaaatctatcgcaaagcttatcaaaaatccttacctttgttgcgtgcgg 

cgagaaagttgattaaaaaattgggtttgtaa 



Predicted protein strain 1218 FutB (SEP ID NO:6) 

mfqpllda>desasiddtsksppplkiavanwwgdeeveefkkiulyfi 

qnaknrfytgenespnJWfdyaigfdeldfrdrylrmpIyydrlhhkaesvndtte^ 

cavvnnesdplkrgfasfvasnpnapkniafydalnsiepvigggsvkntlgym]^^ 

kiidayfshtipiywgspsvaqdfepksfvnvcdfkdfdeaidhvrylhthpnayld^ 

IdflBktilendtiyhdnpfifyrdlnepUsiddlrvnyddlrv^yddlrvnyddlnr^ 

vnyddlrvncddlrvnyddlrvnyerllqnaspllelsqnttMyrkayqkslplkaark^ 
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FIGURE 4 



Fucosvltransferasc strain 19C2 FutB nucleotide sequence fSEO ID NO:7) 

atgttccaacccctattagacgcttatatagacagcacccgtttagatgaaaccgattataagcccccattaaatatagc^^ 

aattggtggccmggataaaagagaaagcaaagggmagaaaaaaatttatcttacatttcattttaagtcagcattacac 

tctccac^gaaacQCtgataaacctgcggacatcgtttttggtaacccccttggatcagccagaaaaatcctatcctatca^ 

ctaaaagggtgttttacaccggtgaaaacgaagtccctaatttcaacctcmgattacgccataggctttgatgaatt^^ 

gatcgttatttgagaatgcctttatattatgatagactacaccataaagccgagagcgtgaatgacaccaccgcaccttacaagatt 

aaatctgacagcctttatgctttaaaaaagccctcccatcattttaaagaaaaccacccacatttatgcgcgctaatcaataatgaga 

tcgatcctttgaaaagagggtttgcgagctttgtcgcaagcaaccctaacgcccctataaggaacgctttctatgaggctttaaattc 

tattgagccagttactgggggagggagcgtgagaaacactttaggctataacgtcaaaaacaaaaacgaatttttgagccaatac 

aagttcaatctgtgctttgaaaacactcaaggctatggctatgttactgaaaaaatcattgacgcttacttcagccacaccattcctat 

ttattgggggggagtccctagcgtggcgaaagattttaacccc 



Strain 19C2 FutB protein sequence (SEP ID NQ:8) 

mfqplldayidstrldetdjIcppMalanv^ldkreskgjfrkk^ 
qnalo^fytgenevpnMfdyaigfdeldfrdiylrniplyydrlh^ 
Icalinneidplkrgfasfvasnpnapirnafyealnsiepvtgggsvrntlgy^ 
iidayfshtipiywggvpsvakdfiip 
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FIGURES 



Strain 915 FutA fiic osvltransferase nucleotide coding sequence (SEP ID NO:9) 

atggcctctaaatctccccccctaaaaatcgctgtggcgaattggtggggagatgaagaaattaaaaaatttaaaaagagcgttct 

ttattttatcctaagccagcattacacaatcactttacaccgaaaccctgataaacctgcggacatcgtctttggtaaccTC^ 

cagccagaaaaatcttatcctatcaaaacgcaaaaagggtgttttacaccggtgaaaatgaagtccctaacttcaacctctttg^ 
cgccataggcttt 



Protein sequence from Strain 915 FutA (SEP ID NO: 10) 

maskspplkiavanwwgdeeikkfkJcsvlyfilsqhytitllm 
fiilfdyaigf 
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FIGURE 6 

Strain 26695 FutA fiicosvltransferase nucleotide coding sequence fSEO ED NO: 1 1 

atgttccaacccctattagacgcctttatagaaagcgcttccattgaaaaaatggcctctaaatctccccccccccccctaaaaatc 

gctgtggcgaattggtggggagatgaagaaattaaagaatttaaaaagagcgttctttattttatcctaagccaacgctacgcaatc 

accctccaccaaaaccccaatgaattttcagatctagtttttagcaatcctcttggagcggctagaaagattttatcttatcaaaacac 

taaacgagtgttttacaccggtgaaaacgaatcacctaatttcaacctctttgattacgccataggctttgatgaattggattttaatga 

tcgttatttgagaatgcctttgtattatgcccatttgcactataaagccgagcttgttaatgacaccactgcgccctacaaactcaaag 

acaacagcctttatgctttaaaaaaaccctctcatcattttaaagaaaaccaccctaatttgtgcgcagtagtgaatgatgagagcg 

atcttttaaaaagagggtttgccagttttgtagcgagcaacgctaacgctcctatgaggaacgctttttatgacgctctaaattccata 

gagccagttactgggggaggaagtgtgagaaacactttaggctataaggttggaaacaaaagcgagtttttaagccaatacaagt 

tcaatctctgttttgaaaactcgcaaggttatggctatgtaaccgaaaaaatccttgatgcgtattttagccataccattcctatttattg 

ggggagtcccagcgtggcgaaagattttaaccctaaaagttttgtgaatgtgcatgatttcaacaactttgatgaagcgattgattat 

atcaaatacctgcacacgcacccaaacgcttatttagacatgctctatgaaaaccctttaaacacccttgatgggaaagcttactttt 

accaagatttgagttttaaaaaaatcctagatttttttaaaacgattttagaaaacgatacgatttatcacaaattctcaacatctttcatg 

tgggagtacgatctgcataagccgttagtatccattgatgatttgagggttaattatgatgatttgagggttaattatgaccggctttta 

caaaacgcttcgcctttattagaactctctcaaaacaccacttttaaaatctatcgcaaagcttatcaaaaatccttgcctttgttgcgc 
gcggtgagaaagttggttaaaaaattgggtttgtaa 

Protein co ding sequence Strain 26695 FutA (SEP ID NO: 12^ 

mfqplIdafiesasiekmasksppppMavanwgdeeikefldcsvlyfdsqryaitmqapnefsdlvfsnplgaarkU 

syqntkrvfytgenespnfiilfdyaigfdeldfedrylnmplyyahmykaelvndttapyklkdnslyalk^ 

pnlcavvndesdllkrgfasfvasnanapmniafydalnsiep\'tgggsvnitlgykvgnkseflsqykfiilcfensqgygy 

vtekildayfshtipiywgspsvakdfiipksfvnvhdfimfdeaidyikylhthpnayldmlyenplntldgkayfyqdisf 

kkildfiEktilendtiyhkfstsfcweydlhlqjlvsiddlrvnyddlrvnydrllqnasplIelsqnttfldyrte^^ 
iklvkklgl* f 
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FIGURE? 

19C2A fiicosvltransferase nucleotide sequence (SEP ID NO: 13) 
atgttccaacccttactagacgcctttatagaaagtgctccaatt 

19C2A predicted protein sequence (SEP ID NO: 14) 
mfqpUdafiesapi 
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FIGURES 



Protein sequence from strain 1 182 FutB aligned with pfem00852, Glyco_transf_10, 
Glycosyltransferase family 10 

Query: 23 PPPIiKIAWmWWGDEEVEEFKKNILyFIIiSQHYTITLHQNPNEPSDLVFGS-PIGSARKI ' 81 
Sbjct: 11 TVPIJiU(aVIWWSI,rEYKEWKKSPrYFIGSQAPQPPIJ^---ILIiWTW 67 

Query: 82 LSYQNAKRVFYTGGN BSPNFNXiF OYAIGFDEIjDFRDRYIiRHPLYyDRIiHHKAES 13 S 

Sbjct: 68 LSyQNTAROUJTANRSPLESADAVIiFHHRDLSKGFPDLPPSPRPPGQPWVWASMBSPSN^ 127 

Query: 136 -VNiyiTSPyKLKPDSLyAIiKKPSHHFKENHPNLCAVVKNESDPIiKRGFASF^^ 193 
Sbjct: 128 GiaJDLRIXSyFNWTLSYRADSDAFHPyGYI^PRLSQVVNAPIJ^AKra 187 

Query: 194 KKNAFYDVnJNSIEPVIGGGSVKNTLGYNIKNKSEFI^YKPbJI^ 253 
Sbjct: 188 KRERFYKQIjNKHIjQVDVGGRVANPLPIiKVGCLVETIJSQYKFYIAFENSQHYDYW 247 

Query: 254 -AYPSHTIPiyWGSPSVAQDFNP-KSFVNVaJFKDFDEAIDHVRyUfTHPNAYL 305 
Sbjct: 24 8 NAIiQAGTIPVNOjGPRAVyEDFVPPKSFIHVDDFKSPKEIADyLLYIiDTNPTAYS 301 
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nOUREP 



Fucosyltransferase from strain 1111 FutA aligned with pfam00852, Glyco_transf_10, 
Glycosyitransferase family 10 

Query: 27 IAVAinWGDEEIKKFKKSVLyFII.SQHYTITLHRNPDKPADIVFG-NPLGSARK:iLSYQN ' 85 

fibjct: 16 IiAlYTWWSIiIEYKEWKKSPIYPIGSQAPQPPIiR ILIiWTWPFNGNPIiALSDCPLSYQN 72 

Query: 86 AKRVPYTGEN---EVPNPNLF---DYAIGPDEU>FRDRYIiRMPLYYAyiiHyKAEL-VOT 138 

Sbjct: 73 TARCRLTANRSPLESADAVLPHHRDLSKGFPDLPPSPRPPGQPWVWASMESPSNSGIjNDL 132 

Query: 139 TSPYKLQPDSLYALKKPSHHPKENHPWLOlVVimESDPIJCRGPASPVASKPM-AP^^ 197 

Sbjct:: 133 RIXSYFNWTLSYRADSDAFHPYGYLEPRLSQVVNTIPLLSAKIUCGJUIWW^^ 192 

Query:. 198 YEALNAIEPVAGGGSVKNTIJ3YNVKNKSEFLSQYKFNLCFENTQGYGYWEK1ID-^^ 256 

Sbjct: 193 YKQLtnOJLQVDVGGRVANPLPLBCVGCLVErriiSQYKPYIiAF^^ 252 

Query: 2S7 HTIPIYVraSPSVAKDENP-KSFVNVHDFNNFDEAIDYIRYIiiTHPNAYIiDMHY^ 315 

Sbjct: 253 GTIPVVIXSPRAVYEDFVPPKSPIHVDDPKSPKELADYIiLiYIiDTjn^ 3 01 

Query: 316 DGKAYFYQOT-SPKiaiDFFKTIIfiNDriYHDNPPIPyroUfEPSVSITO 375 

Sbjct: 302 EYPEWRYDIAVRIiPSWnAIiR 321 

Query; 376 YDDLRVNYDDJ^VNYERU^NASPLI^LSQNTTPKZYRKAYQ 417 

Sbjct: 322 YDEGFCRVCRIiLQNAPD RYKTYPNIAKWFQ 351 



9/22 



wo 2004/009793 



10/521138 

PCT/US2003/023155 



FIGURE 10 

Protein sequence from strain 1218 FutB aligned with pfain00852, GIyco_transf^lO, 
Glycosyltransferase family 10 

PPPIiKIAVANHWGDEEVEEFKKNIIiYFILSQHYTITLHQNPNEPSDliVFGS - PIGSARKI 8 1 

TVPLLIiAIYTWWSLIEYKEWKKSPIYFIGSQAPOPPI*R II*I*WTWPFKGNPI*AIiSDCP , 67 

IiSYQNAKKVFYTGEN BSPNFNLP DYAIGFDEIiDFRDRYliRMPLyYDRIjHHKAES 13 5 

IiSyQNrrARCRLTANRSPLESADAVi:>FHHROLSKGFPDLPPSPRPPGQPHVWASMESPSHS 12 7 



Query: 


23 


Sbjct : 


11 


Query: 


82 


Sbjct: 


£8 


Query: 


136 


Sbjct: 


128 


Query: 


194 


Sbjct: 


188 


Query: 


254 


Sbjct: 


248 


Query: 


312 


Sb j Ct : 


302 


Query: 


372 


Sbjct: 


324 



301 



-EYFEWRYDIiRVRLFSWDALR- - YD 323 
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FIGURE 11 



Protein sequence from strain 19C2 FutB aligned with pfam00852, Glyco_transf_10, 
Glycosyitransferase family 10 



Query: 
Sbjct: 


22 
12 


PPUJIAlJVNWWPiaJKRESKGFRKKFIIJIFII^QHYTIAIiHRNPDKPADIVPG-NPI^ 
VPIiliLAIYTWWSL- - IBYKEW-KKSPIYPIGSQAPQPPLR ILLWTWPFMGNPIALSD , 


80 
65 


Query: 
Sbjct: 


81 
66 


KIIiSYQNAKRVFYTCBN EVPNFNLF DYAIGFDELDFRDRYIiRMPLYYDRiaiHKA 

CPIjSYQNTARCRIiTANRSPIiESADAVLFHHRDLSKGFPDIiPPSPRPPGQPWVWASMESPS 


134 

125 


Query : 
Sbjct: 


135 
126 


ES-VMnrTAPYKIKSDSIiYAIiKKPSHHFKENHPHIiCALINNEIDPIJCRGFASFVASNPN- 
NSGIiNDUUXSYFNWIliSYRADSDAFHPyGYLEPRLSQVVNAPIJJSAKRKG 


192 
185 


Query: 
Sbjct: 


193 
186 


APIRNAFYEALNSIEPVTGGGSVRNTLGYNVKNKNEFLSQYKPNI^CFENTQGYGYVTEKI 
I^KRERFYKQIjNKHLQVDVGGRVANPIjPIjKVGCIjVETriSQYKFYIA^^ 


252 
245 


Query: 
Sbjct: 


253 
246 


ID-AYFSHTIPIYWGGVPSVAKDFNP 277 
HKNALQAGTIFWIiGP-RAVYEDFVP 270 
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FIGURE 12 



llllFutA.pep (1) 

19C2A.pep (1) 

915A.pepaeose (i) 

26695A.pep (l) 

1182B.pep (1) 

1218B.pep (1) 

ORF19C2B.pep (1) 

Consensus (i) 



MTOPLLDAglESAPlKKWPIiN- 
MFQPI^AgrESAPI 



LEPLigivAyiij^ 

SPPLKiAV^iiWWGpEE^^ 



50 



MFQPIiIiDA^IBSASIEKn@CS • 
iVlFQPLLDAg^IESASlgKITgKS - 
MFQPLLDA^<fgSTREgETDY;K- 
MFQPLLDAFIESA ZEK SK 



PPPLKXAVANWWGD - 
PPPLI& AyANWWGDE^ - 
-PI 

PPLKXAVANWWGDEE I 



-FIGKNg 
FKK I 



llllFutA.pep 
19C2A.pep 
915A.pepneose 
26695A.pep 
1162B.pep 
1218B.pep 
ORF19C2B.pep 
Consensus 



llllFutA.pep (96) 

19C2A.pep (16) 

915A.pepneose (79) 

26695A.pep (98) 

1182B,pep (97) 

1218B.pep (97) 

ORF19C2B.pep (98) 

Consensus (101) 



51 _ _ 

(46) LYFILSQHYTi TliHRNPDKS>^SviX3NP^S 

(16) - 1 1" " - " - - - - - - - - - - - - - ----- - - " 

(29) LYFILSQHYTITLffimPDI^^gVJ^NPLig 
(48) LyFILSQR.Y^AITLHQNPNEFgD^VF.S§&^^ 



(47) LYFILSQHYTITLHQNpNE^ 

(47) pYEILSQfiYT 

(48) igiilLSQ^^ 

(51) LYFiLSQHYTITLH MP PADIVFGNPLGSARiaLSYQN^^ 



101 



150 



EVPNFNLFDYAIGF- 



KEiK 



ESgNFl«:iFDYAIGFT)E3^ 



E PNFNLFDYAIGFDELDFRDRYLRMPLYY LHHKAE VNDTTSPYKLK 



llllFutA.pep 
19C2A-pep 
915A.pepneose 
26695A-pep 
1182B.pep 
1218B.pep 
ORF19C2B.pep 
Consensus 



llllFutA.pep 
19C2A.pep 
9 15 A . pepneose 
26695A,pep 
1182B.pep 
1218B,pep 
ORF19C2B.pep 
Consensus 



(146) 

(16) 

(93) 
(148) 

(147) _^ 

(147) PiPSLYi^ia^'SHHF^ 

(148) SDBLYj^^^^SHS^ 
(151) DSLYALKKPSIfflFKEiraPNIjCAVVl^ RN 

201 250 

(196) 
(16) 
(93) 

(198) AF^AIiNS lEPWGGGSVRNTte 

( 197 ) AF.Y5ADNS3:EPVIGGGSVgts 
(198) 

(201) AFYDAIiNSXEPV (3GGSVKNTLGYNVKNKSEFIjSQYKFNLCFEHSQC3YGY 



12/22 



wo 2004/009793 



10/ 

PCT/US2003/023155 



FIGURE 13 



llllFutA 
915A.cod(MWG) 
X9C2FutA. cod 
26695A. cod 
1182B 
1216B.nuc 
ORF19C2B 
Consensus 



1 50 

( 1 ) ATGTTCCAACCCCTATTAGATGCCTTfrATAGAJ^ 

( 1 ) ATGTTCCAACCCpTATTAGATGC<rr.^ATAG^^ 

( 1 ) ATGTTCCAACCCTTACTAGAGGQCTTjTATAGAT^GTG^ - - - - 

( 1 ) ATGTTCCAACCCCTATTAGACGCCri^^TATAGAa^ 

( 1 ) ATGTTCCAACCCjrrATTAGAlC^^ 

(1) ATGTTCCAACCci^TATTAGAtfeCTTATATAGAAlAGfc^ 

(1) atgttccaacccctattaga;otcttatatagacag:c^ 

( 1 ) ATGTTCCAACCCCTATTAGACGCCTTTATAGAAAQCGCTTCCATTGij^^ 



51 



llllFutA 
915A.cod(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



(50) AATGGCCTCTAi^TeTC^^ - - 

(51) AATGGeCTCTAAfl[TCh'<K^ - - 

(46) --:"-V-:'.-r:i:r:--. 

(51) 

(51) AATOACCTCTi^Afl^T^^ 

(51) J^TTAC<:TeM5KATCT 

(51) WVCCGATIATfM!- - - - - -GfeCCfcCAT - - 

(51) AAT GCCTCTAAATCTCCCCCCCC 



100 

-TAAftJ^fCGCTC 
-TiA^^CGgroT^ 



-JAJUMVATCGCTOTGG^^ 
-^AAATkijA^SCCT 
TAAAAATCGCTCmSGOSAATT 



llllFutA 
915A.cod (MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



(.46) 
(101) 
(98) 
(98) 
(92) 
(101) 



101 



GGTGG- 
(^TGG- 



GGTGG 



150 



(95) GGTGQ GGAGATgA 

(95) GGTijG .GGAGATGA -jaLGA^ 



-GGAGATGA- 




vrt^A^jc 

GGAGATGA A6AAATTAAAGAATTTAAAAAGA C TTCTT 



llllFutA (139) 

915A.cod(MWG) (139) 

19C2FutA.cod (46) 

26695A.cod (145) 

1182B (142) 

1218B.nuc (142) 

ORF19C2B (142) 

Consensus (151) 



151 200 

5 — Airx^iTATCfcT^ 

J; ATTTTA5C0$^ 



% ATJl^TATQCT^AG.CgAACGCg^ 



ATTTTAT CTAAG CAGCATTACACAATCAC CTCCACC AAACCC 



llllFutA 
915A.COd(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



201 _ _ 250 

(186) TGATAAACC^GjCGGACAXaSTCTTTC 
(186) - * ^^^r^^ ir.^^^.^;:.;:^^:^^^^.^ 

(46) 
(192) 
(189) 

(189) _ 
(192) TGATAAACCTGCGGACATCGljT^TG^ 

(201) AT AACCT C GA TCGTCTTTGG AA CO CTTGGAT^GCCAGAA 



CAATGAATTT,XCAGATCTA|GT^^ 
CAACGAACCCTGCGATCTOGTCirr^ 
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llllFutA (236) 

9l5A.cod(^aWG) (236) 

19C2FutA.cod (46) 

26695A.cod (242) 

1182B (239) 

1218B.nuc (239) 

ORF19C2B (242) 

Consensus (251) 
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251 300 

A?^TCTTATCCTATCAAAAa3CAi^^ 

AAATCTTATCCTATCAAAACGCAAAAA^^ 



AGATTTTATCTTATCAAAACACTAAACGAGTIGTTTT. 



AAATeOTATCCTATCWUUVCXSC^^ 
AAATOCTATbcTATCAAAA 

AAATCTTATCCTATCAAAACGCAAW^G GTCTTCTACicC^ 



301 



350 



llllFutA (286) 

915A.cod(MWG) (2 86) 

19C2FutA.cod (46) 

26695A.cod (292) 

1182B (289) 

1218B.nuc (289) 

ORF19C2B (2 92) 

Consensus (301) 



GAAGTCCCTAACTTCAACCTCTTTGATOACGCa^TAGO^^ 
GAAGTCeCTAACTTCAAderCTTr^ATOA^ 

GAATCACCTAATTTCAACXTTpTTTG^ 
GAi^TCGCGTAAT^^ 
ci^TCGGCTAATTTCAACCrCTTXGAT^^ 
GAAGTCCCTAATTTCAACCrGi^ 

GAA CCTAATTTCAACCTCTTTGATTACGCCATAGGCTTT GATGAAT 



llllFutA (335) 

915A.cod{MWG) (334) 

19C2Fu1lA.cod (46) 

26695A.cod (341) 

1182B (338) 

1218B.nuc (338) 

ORF19C2B (341) 

Consensus (351) 



351 

TjSGACaeTTAGAGATGGTTAT' 



400 

CACTATCCCTATTpS 



TGGATJ.TTAATGATCGTO 
^rdGATX.T^^AGAiGATq 

TGGA TITAGAGATCGTTATTT AGAATGCCTTT TATTATG T 



llllFutA 
915A.Cod (MWG) 
19C2FutA,cod 
26695A.COd 
1182B 
1218B.nuc 
ORP19C2B 
Concensus 



401 450 

(385) CAT^TATAAAGgGGAGCTl^T^ 

(334) '-.--:-ri---.rr::i_„-jrr-:r:r.i":-:rr-': 

(46) 

(391) pVQTATj^GCCGAG^ 

(388) CAQCAT^j^gCG 

(388) cS.dcATA^GCd 

(391) fiACCATAg^ 

(401) CAC ATAAAGCCGAC3 . GT AATGACACCACT OGCCTTACAAACTCAA 



llllFutA (435) 

915A.cod(MWG) (334) 

19C2FutA.cod (46) 

26695A.COd (441) 

1182B (438) 

1218B,nuc (438) 

ORF19C2B (441) 

Consensus -(451) 



451 _ „ _ 



AGACAAGAGCCTTTATGC^ 
ApCTGA<5A6c?fei^^ 



A CTGACAGCCTTTATGCTTTAAAAAAACCCTCCCATCATTTTAAAGAAA 
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llllFutA (485 

915A,cod(MWG) (334 

19C2FutA-Cod (46 

26695A.cod (491 

1182B (488 

1218B.nuc (488 

ORF19C2B (491 

Consensus (501 



llllFutA (535 

915A.cod(MWG) (334 

19C2FutA.cod (46 

26695A.cod (541 

1182B (538 

1218B.nuc (538 

ORF19C2B (541 

Consensus (551 



llllFutA (585 

915A.cod(iyiWG) (334 

19C2FutA-Cod (4 6 

26695A.COd (591 

1182B (588 

1218B.nuc (588 

ORF19C2B (591 

Consensus (601 



llllFutA (635 

915A.cod(MWG) (334 

19C2FutA.cod (46 

26695A.cod (641 

1182B (638 

1218B-nuc (638 

0RF19C2B (641 

Consensus (651 



llllFutA (685 

915A.cod(MWG) (334 

19C2FutA.cod (46 

26695A.cod (691 

1182B (688 

121BB.nuc (688 

ORF19C2B (691 

Consensus (701 



501 550 

agcaccccaatttgtgcgcagtagtg/^taatoagAct 



ACCACCCCAATTTATGCGiC^GJAGTGA^CAXTC 

ACCACCCCAAT^rgEIATGCGCAdm 

ACCACCCACATTTATGCGCG 

ACCACCC AATTT TGCGCAGTAGTGAA AATGAGAGCGATCCTTOGAi^ 

551 _ 600 

AGAGGGTTTGCGAGCTJTTGTCGC^ 



AGAGGGTTTGGpAGTTOTGTA^ 
AGAGGGI^GCCiAC^ 

AGAGGgTTTGCgAGCTTtCT 

AGAGGGTTTGCGAG ITTGT GC AlSiAACCCTAACGCTCra AATCAA 



601 

cGcrm^/ 



650 



CGCTT15raATGACGCtCTA^ 

TjSCTTTCTATGACGT.TTOT " ~ ~ - ---- — - _ ^^z. 

TGCTTTCTATG/iCGCTTT? 



CGCTTTJCTATGAGgCT^ 
GCTTT TATGA GCTlTAiUlTTCTAT GAGCCAGTTA TGGGGGAGGGA 



651 



700 




gcgtgagaaa<c:act 

gcgtga iaaacacttt aggctataa t aaaaacaa agcgagttttta 



701 



750 



AGCCAATACA;iGTTC(AATCTC 
AGCCAATACAAA^n^ 
AGGCAATACAAA^^TciusLTCTGTC 
AGCCAATACAAGWiAATCTG^ 

AGCCAATACAA TTCMlTCTGTGTTTTOiu^ C CAAGGCTATGGCTA 
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llllFutA 
9l5A.cod(MWG) 
19C2FUt:A.COd 
26695A.cod 
lie2B 
1218B.nuc 
ORF19C2B 
Consensus 



751 800 
(735) TCTAAOTGA^U^ 

(334) — — 

(46) 

(741) TGTiUVCCGAAiUt^ 
(738) TGTiU^CTGA^^i^ 

(738). ix^-KiAcrd;^^ 
(741) ixsTTAcri^S^^ 

(751) TGTAACTGAA/J^TCATTCAOGCTTA TT AGCCATACCATTCCTATTT 



llllFutA 
915A.cod(MWG) 
19C2PutA-cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



801 850 

(785) ATTGGGGG- - AGTCG-CAGCGTGGpGAAA 

(334) -1 — u — 

(46) 

(791) ATTGGGGG- - AGTGX:-CAGCQTGG|EX3Aj^ 

(788) AT^GGGG- -AGTGC-T&SeGTCfe 

(788) ATTGGGGG- -AGTpT-TAGfcGTGbc^ 

(791) Al^TGGGGGGGiAGTCCCTAG^ — 

(801) ATTGGGGG AGTCC AGCGTCGC AAGATTTrTAACCCTAA AGTTTT 



llllFutA 
915A.cod(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



851 900 

(832) GTqM^TGTCCA'rQA'^^ 

(334) -:--.---„--_r-:«r::ir 

(46) 

(838) GyGAATG^GCATG^ 

(835) dTCfOlTG'ijTTG^ibAT^ 

(835) (3!rGiuVTG.a:TTGT6AT^ 

(832) — Ji-Li 

(851) GTGAATGT TGATTT AA A TTTGATGAAGCGATTGA AT T 



llllFutA 
915A.COd(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



901 950 
(882) ATAGTTGCA<^CGGA:CCCAAAeGgT;T^ 

(334) «--:---:----:-:-i--':.:v-rr::t:--:-:-r---r-:-:r:-rr 

(46) 

(888) ATAqCTGGA^ 
(885) ^AGTiti^ggGGM 

(885) atS6ttgEa:^ 

(832) - - - - - - - - ri - _ - - : - iri . i' : . . r: i _ . r r. r rr 

(901) atac tgcacacgcacccaaacgcttatttagacatgc tatgaaaacc 



llllFutA 
91SA.COd(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



951 . , ^ ^ i:9!92 

(932) CTPTTAAAGAl^TAOT 

(334) 1— :r—--i-r:-.-:--..V---i---.-—-r:t':r_: 

(46) -7 

(938) CTTTAAACA^CCCCTG ATGGGJU^ 

(935) CITTAAAG AfeCCOT 

(935) clhraAAACAOCCTTGA'ra 

(832) — ----j-.r:' 

(951) CTTTAAACAC TTGATGGGAAAGCTTACTTTTACCAA ATTTGAGTTTT 
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llllPufcA 
915A.cod(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



llllFutA 
915A.cod(MWG) 
19C2FutA.cod 
26695A,cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



llllFutA 
915A.cod{MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
121BB.nuc 
ORF19C2B 
Consensus 



llllFutA 
915A.cod(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 



llllFutA 
915A.cod(MWG) 
19C2FutA.cod 
26695A.cod 
1182B 
1218B.nuc 
ORF19C2B 
Consensus 
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1001 1050 
( 982 ) AAAAAAATCCTAGATTTTTTTAAAiCGACTOT 

(334) --j: — 

(46) 

(988) AAAAAAATCCTAGATTTITTTAA^ 
(985) AAAAAAATCCTAGATTTraro 
(985) AAAAAAATCGTAGAI^Tt'tTT^^ 

(832) 1 - 1 - - - r J 1 : : i_ . . J.': : , 

(1001) AAAAAAATCCTAGATTTTTTTAAAACGAT ttagaaaacga acgat ta 

1051 1100 
(1032 ) TCACGATAAQCC TTTCATlOTCTATCGTiSA?:^^ 

(334) trr. - : 1 _i _ _ 

(46) 

(1038) TOVCAAATTCTGAACATCa^CATOTC 

(1035) TCACGATAACCC TTTT ATT^TTATCGT^ 

(1035) TCACGATAACCC TTiTATTO^TTTATCGT&iarr^ 

(832) ' — r — r: - ^ : r. - : r. . - 

(1051) TCAC A C C TTT AT T A GAT TG AT AGCC T 

1101 1150 
( 1076 ) CAGTATGTATTGAT(3G5 ?3??^C3aGTT:^^^^i^ 

(46) "lllllZ 

088) 
079) 

(1079) TAATATCTATTGATGAT. TTGAGcf TO^ilpSATGA^ 

(832) — .tr. : : : r : _ : . ij. . irt in' 

(1101) A TATC ATTGATG T TTGAGGGTTAATTATGATGATTTGAGGGTT 

1151 1200 
(1123) i^TTATGATGATTTGAGGGTT2^TTA5GGATGA!^^ 

(334) 1 1: — -y:!:' — j rjr.r 

(46) «^ 

(1135) A^T^AT^ACCGGC^l^^ 

(1129) ^TOATGSTGATTIlGAGGGTT^TTAg^ 

(1126) ^^TA5?p^TGATTTGAGGGTT^TTAS^ 

(1151) AATTATGA T AA T TTT G T T A 

1201 1250 

(1173) GCGCCTTTTACAAAACGCCTCGCCTTTATTAGAACTCTCTCAAAACACCA 
(334) 

(46) 

(1185 ) AAACACCACTTTTA/^AATCTATCGCAAAGCTTATCAAAAATCCTTGCCTT 

(1179) TGATTTGAGGGTTAATTATGATGATTTGAGGGTTAATTATGATGATTTG A 

(1176) TGATTTGAGGGTTAATTATGATGATTTGAGGGTTAATTATGATGATTTGA 

(832) 

(1201) AA 
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llllPutA (1223) 

915A.cod(MWG) (334) 

19C2PutA.cod (46) 

26695A.cod (1235) 

1182B (1229) 

1218B.nuc (1226) 

ORF19C2B (832) 

Consensus (1251) 



1251 1300 
CTTTTAAAATCTATCGCAAAGCTTAXCAiAAAJVTCCTTGCCTTTGOT 



TGTTGOSCGCGGTGAGAAAGTTGGTiywOU^TTGGGTO 

GGGTTAATTATGATGATTTGA(K3GTTAATTATGATO 
GGGTTAATTATGATGATTTGAGGGTTAATT^TGATGATTTGAGGGTC 

T T A A 



llllFutA (1273) 

915A,cod(MWG) (334) 

19C2PutA.cod (46) 

26695A.cod (1279) 

1182B (1279) 

1218B.nuc (1276) 

ORF19C2B (832) 

Consensus (1301) 



1301 

GCCATAAGGAGATGGGTTAAAAAGTAA- 



1350 



TATGATGATTTGAGGGTTAATTATGATGATTTGAGGGTTAATTATGAGCG 
TGTGATGATTTGAGGGTTAATTATGATGATTTGAGGGTTAATTATGAGCG 



1351 



llllFutA (1300) 

915A.cod(MWG) (334) 

19C2PutA.cod (46) 

26695A.COd (1279) 

1182B (1329) 

1218B.nuc (1326) 

ORF19C2B (832) 

Consensus (1351) 



1400 



GCTCTTACAAAACGCCTCGCCTTTATTAGAACTCTCTCAAAACACCACTT 
GCTCTTACAAAACGCCTCGCCTTTATTAGAACTCTCTCAAAACACCACTT 



llllFutA 
915A.CQd(MWG) 
19C2PutA.cod 
26695A.cod 
11828 
1218B.nuc 
ORF19C2B 
Consensus 



1401 1450 

(1300) 

(334) 

(46) 

(1279) 

(1379) TTAAAATCTATCGCAAAGCTTATCAAAAATCCTTACCTTTGTTGCGTG 

(1376) TTAAAATCTATCGCAAAGCTTATCAAAAATCCTTACCTTTGTTGCGTGCG 

(832) 

(1401) 



1451 



llllFutA (1300) 

915A. cod (MWG) (334 ) 

19C2FutA.COd (46) 

26695A,cod (1279) 

1182B (1429) 

1218B.nuc (1426) 

ORP19C2B (832) 

Consensus (1451) 



1483 



GCGAGAAAGTTGATTAAAAAATTGGGTTTGTAA 
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CO 





Actual 

Percent 
Recovery 


<o 


00 
CD 




Total Yield 


1.567 g 


1.760 g 


1.221 g 


Resin Type 


MR3 NH4HCO3 
column (1ml 
resin/1 ml 
synthesis) 


MR3 NH4HCO3 
column (1ml 
resin/1 ml 
synthesis) 


Dowexl/Dowex 
50 (2ml 
resin/1 ml 
synthesis) 


Batch 
Number 


1-02 


2-02 


3-02 
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Transferrin SA 





1500 1700 1900 2100 2300 



2500 
ffl/z 



2700 2900 3100 3300 3500 



TransferrIn.ASJ182B 





k 



1250 



1750 



2250 



2750 



3250 



mk 
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